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FIELD AND BACKGROUND OF THE INVENTION 

The present invention relates to a method of identifying peptides of a 

desired origin and which are capable of binding to MHC molecules of a 

particular haplotype; peptides identified by the method; pharmaceutical 
25 compositions containing the peptides, databases describing the peptides and 

the use of the peptides in vaccination . 

The following abbreviations are used herein: MHC, Major 

Histocompatibility Complex; (J2m 9 ($2-microglobulin; ESI, electrospray 

ionization; MS, mass spectrometry; m/z, mass chargei ratio; CID, collision 
30 induced disintegration; MS/MS, tandem mass spectrometry; MTDM, DNA 

methyl transferase; FAS, fatty acid synthase; CTL ? cytotoxic T lymphocytes; 

mAbs ? monoclonal antibodies. 
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The MHC serves as a shuttle ,o transport and display pepdde ffiligens 
on the surface of cells as an indication ,o the immune ,ys,e m of a* health s«e 
of A. cells. Each Individ,*, has at most six differ, MHC c,ass-I haplotypes 
ou, of the hundreds known. MHC bound peptides, i.e, peptides bound ,„, and 
» P"*med in context „f, MHC molecu.es, originate from pro,e„ lysis rf- of 
the proteins expressed in the ceils. Therefore, unique sets of MHC bound 
peptides are displayed by each of the different MHC haplotypes according to 
the protein expression and degradation schemes of «he ceils and according ,„ 
p the peptide binding motifs of the MHC molecules (reviewed in [1]). 

10 Therefore, mousands of different peptides are presented by the different MHC 
dass-1 hapiorypes and each of the peptides is presented in vastly differing 
copies per cell [2]. When cells become infected, some of the presented 
peptides are derived from the pathogen's proteins, and so indicate to 
circulating T-cells to kill the diseased cells and prevent the spread of the 
15 disease. 

Each MHC haplotype recognizes the peptides through a broadly defined 
consensus motif of peptide's amino acids strategically positioned to serve as 
anchors to the appropriate binding pockets on the MHC molecule. The 
binding motifs of many of the MHCs haplolypes were first established by pool 
20 Edman sequencing of unfractionated peptide mixtures eluted from 
immunoaffinity purified MHC molecules [3, 4]. The consensus was further 
extended by direct biochemical analysis of individual peptides separated by 
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reversed phase chromatography and analyzed by tandem mass spectrometry [2, 
5, 6], reviewed in [7], 

MHC bound peptides derived from cancer specific or associated 
proteins or antigens were extensively searched for, with the goal of finding 
5 among them peptide candidates for development of anti-cancer vaccines. A 
number of such tumor specific peptides were already identified and some were 
successfully tested as anti-cancer vaccines for human treatment, most notably 
for immunotherapy of melanoma [8, 9]. Three main approaches were 
extensively used for the identification of such MHC bound peptides [10]. The 
10 genetic approach involves transfection of cDNA libraries, made from tumor 
cells, into cells that present the MHC allele of interest. The clones of 
£ transfected cells that stimulated CTL lines against the tumor cells were 

selected as the source for the tumor antigen and the genes were further 
fragmented to isolate the regions of the genes that encode the particular 
15 immunogenic peptide [11]. The second approach is based on exploiting the 
known consensus binding motifs of the MHC haplotype of interest to scan 
sequences of known protein "m silico" and to predict putative MHC bound 
peptides that fit this consensus [12], For successful prediction, these 
consensus motifs should be a priori well established, which is not the case for 
20 many of the MHC haplotypes [13]. The drawback of this approach is its 
reliance on chemical synthesis of a large number of peptides, only few of 
which end up being useful The biochemical, third approach, involves the 
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availability of advanced mass spectrometry, needed for analyzing the available 
minute .mounts of peptide, tha, are present a. very complex rotaures 
(reviewed in [7]). 

All these approaches for identifying MHC bound peptides eventually 
rely on chemical synthesis of the peptides of interest to test their capacity to 
bind to the MHC molecule by stabilization of empty MHC molecules on cell 
surface [17], and their potential to elicit an immune response by tetramer 
assays [18], ELISPOT [19] and elicitation CTL responses when presented on 
cells [20]. 

Currently, sequencing and identification of individual MHC bound 
peptides by the direct biochemical approach is most effectively performed by 
use of tandem mass spectrometry. The peptides are resolved by reversed phase 
chromatography and the eluting peptides are collected, assayed for biological 
activity and sequenced, most often by electrospray tandem mass spectrometry 
[2, 5, 21], Comparing the patterns of MHC bound peptides recovered from 
healthy and infected cells helps to identify disease related peptides [22]. Mass 
spectrometry is advantageous for such analysis due to its accuracy, speed of 
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analysis, its ability to analyze complex mixtures of peptides and its high 
sensitivity [7], The biochemical analysis involves the purification of the MHC 
molecules with their bound peptides by immunoaffinity chromatography using 
mAbs specific for the native MHC [2]. To this end, the cells are solubilized 
5 with detergents, the desired MHC molecules are purified with their MHC 
bound peptides and the MHC bound peptides are recovered by denaturation 
and ultra-filtration, However, once the cells are disrupted by the detergents, 
the MHC molecules become contaminated by cellular debris and detergents 
which complicates the subsequent ESI-MS/MS analysis. Moreover, such 
10 immunoaffmity purification of desired MHC haplotypes is possible only when 
specific mAbs are available, whereas for many MHC haplotypes such mAbs 
are presently unavailable. 

There is thus a widely recognized need for, and it would be highly 
W advantageous to have, a method for identifying MHC bound peptides devoid 

lu 
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is of the above limitations. 
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STTMMAR V OF THE INVENTION 

While conceiving the present invention, it was hypothesized that MHC 
20 bound peptides presented within the context of different MHC haplotypes on 
cells of different tissues or tumor origins can be biochemically identified by 
transforming the cells to express and secrete soluble MHC molecules of the 
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different MHC haplotypes, with the aim of biochemically identifying the MHC 
bound peptides that bind to the soluble MHC molecules. Should this approach 
be successful, it solves three major problems associated with the prior art 
biochemical approach. First, although not excluded., there is no need for 
5 specific mAbs per each tipe of MHC, rather general mAbs such as W6/32 (anti 
HLA-A, B and C) can be used to isolate the sMHC and hence the MHC bound 
peptides from the growth medium in which the cells are grown. Second, while 
the prior art approach relies on native MHC molecules, different MHC 
haplotypes directing the expression of different soluble MHC molecules can 

10 potentially be used for each of the cells, to thereby increase the repertoire of 
MHC bound peptides which can be used as, for example, anti-cancer vaccines. 
Third, since the cells are not disrupted and further since there is no use of 
detergents, the sMHC molecules do not become contaminated by cellular 
debris and detergents which otherwise complicates the subsequent 

15 ESI-MS/MS analysis. 

According to one aspect of the present invention there is provided a 
method of identifying peptides originating from a particular cell type and being 
capable of binding to MHC molecules of a particular haplotype, the method 
comprising obtaining a cell type expressing a soluble and secreted form of the 
MHC molecules of the particular haplotype; collecting the soluble and secreted 
form of the MHC molecules of the particular haplotype; and analyzing 
peptides bound to the soluble and secreted form of the MHC molecules of the 
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particular haplotype, thereby identifying the peptides originating from the 
particular cell type and being capable of binding to MHC molecules of the 
particular haplotype. 

According to further features in preferred embodiments of the invention 
described below, the cell type is a cancer cell, 

According to still further features in the described preferred 
embodiments the cell type is a cancer cell line. 

According to still further features in the described preferred 
P embodiments the cell type is a virus infected cell or cell line. 

According to still further features in the described preferred 
embodiments the cell type is a cell involved in a development and/or 
progression of an autoimmune diseases. 

According to another aspect of the present invention there is provided a 
method of identifying peptides originating from at least one protein of interest 
and being capable of binding to MHC molecules of a particular haplotype. the 
method comprising obtaining cells co-expressing the at least one protein of 
interest and a soluble and secreted form of the MHC molecules of the 
particular haplotype; collecting the soluble and secreted form of the MHC 
molecules of the particular haplotype; analyzing peptides bound to the soluble 
and secreted form of the MHC molecules of the particular haplotype; and 
identifying peptides originating from the at least one protein of interest and 
being capable of binding to MHC molecules of the particular haplotype. 
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According to further features in preferred embodiments of the invention 
described below, the protein of interest in natively expressed by the cells. 

According to still further features in the described preferred 
embodiments the at least one protein of interest in expressed by the cells 
following transformation of the cells with nucleic acid encoding for the at least 
one protein of interest. 

According to still further features in the described preferred 
embodiments the at least one protein of interest includes a tumor associated 
antigen. 

According to still further features in the described preferred 
embodiments the at least one protein of interest includes a cytokine. 

According to still further features in the described preferred 
embodiments the at least one protein of interest includes a protein of a 
pathogen. 

According to still further features in the described preferred 
embodiments the soluble and secreted form of the MHC molecules include a 
polypeptide encoded by exons 5 to 8 of a murine mutant QlQb. 

According to still further features in the described preferred 
embodiments analyzing the peptides bound to the soluble and secreted form of 
the MHC molecules of the particular haplotype is by mass spectrometry, mass 
charge ratio and collision induced disintegration. 
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According to still further features in the described preferred 
embodiments identifying peptides originating from the at least one protein of 
interest and being capable of binding to MHC molecules of the particular 
haplotype is by comparison to a protein database. 

According to another aspect of the present invention, there is provided 
an electronic data storage device, storing, in a retrievable form, a plurality of 
sequences of peptides identified by the methods described herein. 

According to still another aspect of the present invention, there is 
provided a kit comprising a plurality of individual containers, each of the 
plurality of individual containers containing at least one peptide identified by 
the methods described herein. 
^ According to yet another aspect of the present invention there is 

provided a method of identifying peptides originating from cancer associated 
proteins and being capable of binding to MHC molecules of a particular 
haplotype, the method comprising obtaining a cancer cell type expressing a 
soluble and secreted form of the MHC molecules of the particular haplotype; 
collecting the soluble and secreted form of the MHC molecules of the 
particular haplotype; analyzing peptides bound to the soluble and secreted 
form of the MHC molecules of the particular haplotype; and identifying 
peptides originating from cancer associated proteins and being capable "'of 
binding to MHC molecules of the particular haplotype. 
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According to still another aspect of the present invention there is 
provided a method of identifying peptides originating from cells participating 
in the development and/or progression of an autoimmune disease and being 
capable of binding to MHC molecules of a particular haplotype, the method 
5 comprising obtaining cells participating in the development and/or progression 
of the autoimmune disease and expressing a soluble and secreted form of the 
MHC molecules of the particular haplotype; collecting the soluble and secreted 
form of the MHC molecules of the particular haplotype; analyzing peptides 
bound to the soluble and secreted form of the MHC molecules of the particular 
10 haplotype; and identifying peptides originating from proteins participating in 
the development and/or progression of the autoimmune disease and being 
capable of binding to MHC molecules of the particular haplotype. 

According to an additional aspect of the present invention there is 
provided a method of identifying peptides originating from virus infected cells 
and being capable of binding to MHC molecules of a particular haplotype, the 
method comprising obtaining virus infected cells expressing a soluble and 
secreted form of the MHC molecules of the particular haplotype collecting the 
soluble and secreted form of the MHC molecules of the particular haplotype; 
analyzing peptides bound to the soluble and secreted form of the MHC 
molecules of the particular haplotype; and identifying peptides originating 
from the virus and being capable of binding to MHC molecules of the 
particular haplotype. 
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According to yet an additional aspect of the present invention there is 
provided a method of identifying peptides originating from a particular cell 
type characterized by at least one of the following (i) cell over-expressing at 
least one protein; (ii) cells characterized by induced mutations; (iii) cells of 
metastases; (iv) normal or transformed cells expressing cell surface proteins, 
the peptides being capable of binding to MHC molecules of a particular 
haplotype, the method comprising obtaining cells of the particular cell type 
expressing a soluble and secreted form of the MHC molecules of the particular 
haplotype; collecting the soluble and secreted form of the MHC molecules of 
the particular haplotype; analyzing peptides bound to the soluble and secreted 
form of the MHC molecules of the particular haplotype; and identifying 
peptides originating from the particular cell type and being capable of binding 
to MHC molecules of the particular haplotype. 

According to still an additional aspect of the present invention there is 
provided an electronic data storage device, storing, in a retrievable form, a 
plurality of peptides being arranged at least according to their association with 
a pathology and further according to their ability of binding to MHC molecules 
of a particular haplotype. 

According to a further aspect of the present invention there is provided 
an electronic data storage device, storing, in a retrievable form, a plurality of 
peptides being arranged at least according to their association with a protein of 
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interest and further according to their ability of binding to MHC molecules of 
a particular haplotype. 

According to yet a further aspect of the present invention there is 
provided a method of eliciting an immune response against a protein of interest 
in a subject having a particular MHC haplotype, the method comprising 
determining the subject's particular MHC haplotype; and administering to the 
subject an effective amount of at lea$t one peptide derived from the protein of 
interest and which is capable of binding to MHC molecules of the particular 
haplotype. 

|S According to still a further aspect of the present invention there is 

|ll provided a method of treating a pathology by eliciting an immune response 

tii 

£ against a protein of interest in a subject having a particular MHC haplotype, 

IB 

^ the method comprising determining the subject's particular MHC haplotype; 

0 

5 and administering to the subject a therapeutic effective amount of at least one 

ly 

P e ptid e derived from the protein of interest and which is capable of binding to 
r MHC molecules of the particular haplotype. 

According to an additional aspect of the present invention, there is 
provided a method of eliciting an immune response against a protein of interest 
in a subject, the method comprising using an individualized in vitro assay for 
determining an immune reactivity of an immune system of the subject to a 
plurality of peptides derived from the protein of interest; and administering to 
the subject an effective amount of at least one peptide derived from the protein 
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of interest and which is capable of inducing predetermined sufficient immune 
reactivity. 

According to further features in preferred embodiments of the invention 
described below, administering to the subject the therapeutically effective 
amount of the at least one peptide is accompanied by presenting the at least 
one peptide in context of an antigen presenting cell. 

According to still an additional aspect of the present invention, there is 
provided a peptide selected from the group consisting of SEQ ID NOs;4-6, 
10-14, 19-21, 23-37, 44-88 3 90-141, 143-144, 146-173, 175-189 and 191-195, 
all of which were never reported to bind MHC molecules* 

According to still an additional aspect of the present invention, there is 

in 

-P provided a peptide selected from the group consisting of SEQ ID NOs: 5, 9, 10 

5WS, 



in 

"V* -. 



and 25. 

According to yet an additional aspect of the present invention, there is 
provided a peptide selected from the group consisting of SEQ ID NOs: 13, 20, 
23 and 24. 

According to another aspect of the present invention, there is provided a 
pharmaceutical composition comprising, as an active ingredient, at least one of 
the peptides described herein, and a pharmaceutical^ acceptable carrier. 
Preferably, the at least one of the peptides is presented in context of an antigen 
presenting cell. 
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According to further features in preferred embodiments of the invention 
described below, the peptide comprises at least one modification rendering 
peptides more stable in a body and/or more immunogenic. 

According to still further features in the described preferred 
5 embodiments the at least one modification is selected from the group 
consisting of peptoid modification, semipeptoid modification, cyclic peptide 
modification, N terminus modification, C terminus modification, peptide bond 
modification, backbone modification and residue modification. 

The present invention successfully addresses the shortcomings of the 
io presently known configurations by providing a novel method for the 



||1 identification of MHC bound peptides. 



BRIEF DESCRIPTION OF THE DRAWINGS 
g 15 The invention is herein described, by way of example only, with 

reference to the accompanying drawings, With specific reference now to the 
drawings in detail, it is stressed that the particulars shown are by way of 
example and for purposes of illustrative discussion of the preferred 
embodiments of the present invention only, and are presented in the cause of 
20 providing what is believed to be the most useful and readily understood 
description of the principles and conceptual aspects of the invention. In this 
regard, no attempt is made to show structural details of the invention in more 
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In the drawings; 

FIGs. l A-C demote tt,e purified „ f soluble MHC from oancer 
«U, S „, uble MHCs was purifie d by to™^ fam ^ ^ 
»*- „f 5xl0 > Klk ^ ^ ^ ^ ^ ^ 

SDS-PAOE and stained ^ CoolMssie ^ ^ 

mA-A2/Q I0 b fr„ m MCF-7 cells, (,B) ft.nficanon of sHLA-A2 from 
I l ° UCI - 101 Cells ' ^cation of sHLA-B7 from UCI-107 cells. 

FIGs. 2A-C show a typical nano-capillaiy reversed-phase 
chromatography of MHC bound peptides purified from soluble MHC from 
5X10 7 MCF-7 breast cancer cells. (2A) The total-ion-current chromatogram 
(TIC). (2B) Mass spectrum taken at the time point of 33.3 minutes. (2C) 
| 15 Spectrum of the collision-induced-disintegration (CID) of the dominant 
peptide in 2B having a m/z of 1028.5 that eluted at the 33.3 minutes. The 
putative MHC peptide GLIEKNIEL (SEQ ID NO:13) was identified to 
originate from DNA-methyl transferase. 

FIGs. 3A-B show a comparison of the chromatographs, the MS and the 
20 CID spectra of the synthetic peptide: GLIEKNIEL (SEQ ID NO: 13) of the 
DNA methyl transferase (3A) with those of the peptide m/z=1028.5 (SEQ ID 
NO: 13) from the breast cancer line MCF-7 (3B). 
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FIGs, 4A-D demonstrate the evaluation of the correctness of the 
MAGE-B2 peptide pl091 (GVYDGEEHSV, SEQ ID NO.20) by comparing 
the retention time and CID spedra of the synthetic peptide (4A) to that of the 
natural peptide m/z-1091.4 (SEQ ID NO:20) from the ovarian cancer line 
5 UCI- 1 07 (4B). (4C) Evaluation of the binding affinity of peptide p 1 09 1 (SEQ 
ID NO:20) to HLA-A2 by reconstituting it into cells surface empty MHC of 
the RMA-S-HHD cells as assayed by FACS analysis. (4D) The homology 
between this MAGE-B2 peptide, pi 091 (SEQ ID NO:20) to two other already 
p known HLA-A2 peptides derived from homologous region in MAGE-A4 

; in? 

g io GVYDGREHTV (SEQ ID NO:38) [27] and MAGE-A10 proteins 
01 

m GLYDGMEHL (SEQ ID NO:39) [28]. 

m 

■£ FIG. 5 shows an example of reconstitution of peptides into cells surface 

L MHC to test their binding and affinity as assayed by FACS analysis. Synthetic 

pjj peptides were added to 10 6 RMA-S-HHD cells to a concentration of 100 mM 

i 

p 15 followed by incubation for two hours at 26°C and two hours at 37°C. The 
stability of the peptides binding to the HHD cells was measured by indirect 
FACS assay after decoration for another hour with the W6/32 raAb at 4°C and 
30 minutes incubation with FITC goat anti-mouse Ab at 4°C. The HLA-A2.1 
peptide derived from gplOO served as a positive control and unloaded 
20 RMA-S-HHD cells as a negative control. 

FIG. 6 demonstrates a CTL assay with murine cells presenting human 
MHC (EL4-HHD). Cells were loaded separately with individual peptides, 
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washed and injected in four groups: 1- pl028 (SEQ ID NO: 13) alone, 2- 
pl258 (SEQ ID NO:24) alone, 3- pool of peptides: p913 (SEQ ID NO:5), p958 
(SEQ ID NO:9), p989 of CD59 (SEQ ID NO;ll) and p989 of FU (SEQ ID 
NO:12) s 4- peptides pl031 (SEQ ID NO: 14), pll21 (SEQ ID NO:22) and 
5 pl068 (SEQ ID NO:16). Unloaded EL4-HHD or targets cells not loaded with 
the peptides were used as negative controls. An effector-to-target ratio of 50: 1 
is shown. 
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10 DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention is of a method of identifying peptides of a desired 
origin, such as tumor associated antigens, pathogen (e.g., virus, bacteria) 
derived antigens, endogenous cytokines, etc., which are capable of binding to 
J* \ MHC molecules of a particular haplotype. The present invention is further of 

ry 

% 15 peptides identified by the method and pharmaceutical compositions containing 

M 

the peptides. Still, the present invention is further of databases describing the 
peptides and the use of the peptides in vaccination to treat and/or prevent 
various pathologies, cancer and autoimmune diseases, in particular* 

The principles and operation of the present invention may be better 
20 understood with reference to the drawings and accompanying descriptions. 

Before explaining at least one embodiment of the invention in detail, it 
is to be understood that the invention is not limited in its application to the 
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details set forth in the following description or exemplified by the Examples. 
The invention is capable of other embodiments or of being practiced or carried 
out in various ways. Also, it is to be understood that the phraseology and 
terminology employed herein is for the purpose of description and should not 
be regarded as limiting. 

According to one aspect of the present invention there is provided a 
method of identifying peptides originating from a particular cell type and being 
capable of binding to MHC molecules of a particular haplotype* The method 
according to this aspect of the present invention is effected by obtaining a cell 
type expressing a soluble and secreted form of the MHC molecules of the 
particular haplotype; collecting the soluble and secreted form of the MHC 
molecules of the particular haplotype; and analyzing peptides bound to the 
soluble and secreted form of the MHC molecules of the particular haplotype, 
thereby identifying the peptides originating from the particular cell type and 
being capable of binding to MHC molecules of the particular haplotype. 

According to another aspect of the present invention there is provided a 
method of identifying peptides originating from cancer associated proteins and 
being capable of binding to MHC molecules of a particular haplotype. The 
method according to this aspect of the present invention is effected by 
obtaining a cancer cell type expressing a soluble and secreted form of the 
MHC molecules of the particular haplotype; collecting the soluble and secreted 
form of the MHC molecules of the particular haplotype; analyzing peptides 
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bound to the soluble and secreted form of the MHC molecules of the particular 
haplotype; and identifying peptides originating from cancer associated proteins 
and being capable of binding to MHC molecules of the particular haplotype. 

According to still another aspect of the present invention there is 
provided a method of identifying peptides originating from cells participating 
in the development and/or progression of an autoimmune disease and being 
capable of binding to MHC molecules of a particular haplotype. The method 
according to this aspect of the present invention is effected by obtaining cells 
p participating in the development and/or progression of the autoimmune disease 

05 an.d expressing a soluble and secreted form of the MHC molecules of the 

Si 

111 particular haplotype; collecting the soluble and secreted form of the MHC 

111 

42 molecules of the particular haplotype; analyzing peptides bound to the soluble 

t and secreted form of the MHC molecules of the particular haplotype; and 

I! ; identifying peptides originating from proteins participating in the development 

)5 and/or progression of the autoimmune disease and being capable of binding to 

^ MHC molecules of the particular haplotype. 

According to another aspect of the present invention there is provided a 
method of identifying peptides originating from virus infected cells and being 
capable of binding to MHC molecules of a particular haplotype. The method 
according to this aspect of the present invention is effected by obtaining virus 
infected cells expressing a soluble and secreted form of the MHC molecules of 
the particular haplotype; collecting the soluble and secreted form of the MHC 



20:52 FROM: 



TO: 972 9 7706333 



20 

molecules of the particular haplotype; analyzing peptides bound to the soluble 
and secreted form of the MHC molecules of the particular haplotype; and 
identifying peptides originating from the virus and being capable of binding to 
MHC molecules of the particular haplotype. 

According to still another aspect of the present invention there is 
provided a method of identifying peptides originating from a particular cell 
type characterized by at least one of the following (i) cell over-expressing at 
least one protein; (ii) cells characterized by induced mutations; (iii) cells of 
metastases; (iv) normal or transformed cells expressing cell surface proteins, 
the peptides being capable of binding to MHC molecules of a particular 
haplotype. The method according to this aspect of the present invention is 
effected by obtaining cells of the particular cell type expressing a soluble and 
secreted form of the MHC molecules of the particular haplotype; collecting the 
soluble and secreted form of the MHC molecules of the particular haplotype; 
analyzing peptides bound to the soluble and secreted form of the MHC 
molecules of the particular haplotype; and identifying peptides originating 
from the particular cell type and being capable of binding to MHC molecules 
of the particular haplotype. 

In general, the present invention provides a method of identifying 
peptides originating from at least one protein of interest or an unknown protein 
and being capable of binding to MHC molecules of a particular haplotype. 
The method is effected by obtaining cells co-expressing the at least one protein 
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of interest or unknown protein and a soluble and secreted form of the MHC 
molecules of the particular haplotype; collecting the soluble and secreted form 
of the MHC molecul es of the particular haplotype; analyzing peptides bound to 
the soluble and secreted form of the MHC molecules of the particular 
haplotype; and identifying peptides originating from the at least one protein of 
interest or unknown protein and being capable of binding to MHC molecules 
of the particular haplotype. Depending to a great extent on the cell type 
employed, it will Once a peptide of an unknown protein is identified, this 
p protein becomes a protein of interest. 

\&$ 

0(1 The protein of interest or unknown protein can be a native protein 

CI 

expressed by the cells, or the protein of interest can be expressed by the cells 

VI 

^ following transformation of the cells with nucleic acid encoding for th e protein 

1^ of interest using techniques well known in the art. 

flj The method of the present invention can thus be used to associate 

D proteins of yet unknown pattern of expression with particular tissues or cell 

types, such as cancer cells. In addition, the method of the present invention 
can be used to determine whether a specific open reading frame (ORF) is 
expressed or not in certain cells. 

In one preferred embodiment of the present invention the cell type is a 
cancer cell or a cancer cell line. Primary cell lines, metastatic cell lines, tumor 
cell lines and normal cell lines which are suitable for implementing the method 
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of the present invention are available, for example, from ATCC. Tables 1 and 
2 below provide examples: 



TABLE 1 
Primary and metastatic cell lines 



Primary Call Line 


Metastatic Cell Line 


ATCC 
No. 


Name 


Disease 


Tissue 


ATCC 
No. 


Name 


Tissue 


CCU228 


SW48Q 


colorectal 
adenocarcinoma 


colon 


CCL-227 


SW620 


lymph 
node 


CRL-1864 


RM 


gastric adenocarcinoma 


stomach 


CRL-1863 


RF-48 


66 Cites 


CRL-1675 


WM-115 


melanoma 


skin 


CRL-1676 


WM-266-4 


n/a 


CRL-7425 


Hs 688(A).T 


melanoma 


akin 


CRl-7425 


H$ 6$a($)T 


lymph 
node 



TABLE 2 
Tumor and normal cell lines 



SO 


Tumor Cell Une 


Normal Cell Line 


ffi 


ATCC 


Name 


Cancor 


Tlssuo 


ATCC 


Name 


Tlssuo 


'i n 


No. 




Type 


Source 


NO. 




Source 


m 


CCL-258 


NCI-H2126 


carcinoma: non-small 


lung 


CCL-256.1 


NCI-BL2126 


peripheral 








cell (una. cancer 








blood 


63 


CRL-5B68 


NCWH1395 


adenocarcinoma 


lung 


CRL-5957 


NCI-BL1395 


peripheral 
blood 


& 


CRL-5872 


NCI-H1437 


adenocarcinoma 


lung 


CRL-5958 


NCI-8L1437 


peripheral 
















blood 


sH 

111 


CRL-5882 


NCI-H1848 


adenocarcinoma 


lung 


CRL-S9S4 


NCM3L1648 


peripheral 
blood 




CRL-S911 


NCI-H2009 


adenocarcinoma 


lung 


CRL-5981 


IMCI-Bl2u09 


peripheral 


□ 














blood 


CRL-5985 


NCI-H2122 


adenocarcinoma 


pleural effusion 


CRL-5S67 


NCI-BL2122 


peripheral 
blood 




CRL-5922 


NCI-H2087 


adenocarcinoma 


lymph node 
(metastasis) 


CRL-5985 


NCI-BL2087 


peripheral 
blood 




CRL-5888 


NCI-H1672 


carcinoma; classic 
small cell lunq cancer 


lung 


CRL-5959 


NCI-BL1672 


peripheral 
blood 




CRL-5929 


NCI-H2171 


carcinoma: small cell 
lunn cancer 


lung 


CRL-S969 


NCI-BL2171 


peripheral 
blood 




CRL-5931 


NCI-H219B 


carcinoma; small cell 
lung cancer 


lung 


CRL-5956 


NCI-0L219S 


peripheral 
blood 




CRL-5858 


NCI-H1184 


carcinoma; small can 
lung cancer 


lymph node 
(metastasis) 


CRL-5949 


NCI-BL11B4 


peripheral 
blood 




HTB-172 


NCI-H209 


carcinoma; small cell 
lung cancar 


bone marrow 
(metastasis) 


CRL-5948 


NCI-BL2Q9 


peripheral 
blood 




CRL-5983 


NCI-H2107 


carcinoma; email cell 
lung cancer 


bone marrow 
(metastasis) 


CRL-S966 


NCI-BL2107 


peripheral 
blood 




MT&-120 


NCI-H128 


carcinoma: small cell 
lunfl cancar 


pleural effusion 


CRL-S947 


NCI-BL128 


peripheral 
blood 




CRL-5915 


NCI-H2D52 


mesothelioma 


pleural effusion 


CRL-S983 


NCI-BL2062 


peripheral 
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*0 



5rw 



1^1 



D 















diooo 


CRL-5893 


NCI-H1770 


neuroendocrine 
caret noma 


lymph node 
(metastasis) 


CRU5960 


NCI-BL1770 


peripheral 
blood 


HTB-126 


Hs S7BT 


ductal carcinoma 


mammary 
gland; breast 


HTB-125 


HS 5760SI 


mammary 

gland; 

breast 






uuctpi carcmome 


mammary 
gland; breast 




urn nm 
BL 


blood 


r>Ri -9V\A 


HbU 1334 


QUuiJi cd ranDmd 


rTiaiTirnary 

Aland; breast 




1 v ^* 


blood 


PR1 -9**1il 




primary uuciaj 

carcinoma 


mammary 

gtand; breast 


CRl -944ft 

lynL'fiJ'rD 


UCC38 BL 


nprinhpral 

blood 




n*/ 1 *! 14 J 


primary ductal 
carcinoma 


mammary 
aland: breast 


not 


BL 


norinharnl 

pel ipi icrni 

blood 


r*Di o*ao9 


U/**t*M "tfl7 


primary ductal 
carcinoma 


mammary 
gland: breast 


r*Ri .919^ 


BL 


blood 


LKL-^4i£4 




primary ductal 
carcinoma 


mammary 
gland; breast 




BL 


poripfreini 

blood 






pnmary auciai 
carcinoma 


nidrTirri tiry 

gland; breast 




nUw I 039 

BL 


ptal l|Ji ralal 

blood 


^Dl HOC 

CRL-2336 


nC019d7 


primary ductal 
carcinoma 


mammary 
gland: breast 






nannharsl 

pcf ipnciai 

blood 


CRL-2340 


HCC2157 


primary ductal 
carcinoma 


mammary 
gland; breast 


UKL-Z341 


BL 


peripheral 
blood 


CRL-2343 


HCC221 8 


primary ductal 
carcinoma 


mammary 
gland: breast 


CRL-23oo 


BL 


peripheral 
blood 


CRL-7345 


Hs 574.T 


ductal carcinoma 


mammary 
gland: breast 


CRL-73W 


i_i 0 err a cl 
MB 0f4.oK 


skin 


^ftl "7,4 0*1 


Hs 742 ,T 


scirrhous 
adenocarcinoma 


mammary 
gland; breast 


f*Rl _7Afl1 


rip r4jC a 9r 


skin 


LfKL"r J I) J 




cancer 


mdmrndiy 

gland; breast 


ur\L r JUl 


ri0 tiTUiwh 


skin 


OKL-74ob 


Ub "/A Q T 


cancer 


mammary 








CRL-7365 


H3 605.T 


carcinoma 


mammflfV 

iiiaiiiiiiaiy 

Inland; breast 


CRL-7364 


Hs 605 Sk 


skin 


CRL-7368 


H3 606 


carcinoma 


mammary 
gland; breast 


CRL-7367 


H8 606.Sk 


skin 


CRL-1974 


COLO 829 


malignant melanoma 


skin 


CRL-1 980 


COLO 
628BL 


peripheral 
blood 


CRL-7762 


TE 354 T 


basal cell carcinoma 


skin 


CRL-77B1 


TE 353,Sk 


skin 






skin 


CRL-7669 


Hs 939.Sk 


skin 


CRL-7360 


Hs BOOT 


melanoma 


skin 


CRL-73S9 


Hs 6Q0.Sk 


skin 


CRL-7677 


Hs 925.T 


oaqetoid sarcoma 


skin 


CRL-7676 


Hs 925.Sk 


skin 


CRL-7672 


Hs9l9T 


benign osteoid 
osteoma 


bone 


COl -7G71 

UKL a /ar 1 


Ha Q1Q c;k 




CRL-7554 


Hs 821. T 


plant cell sarcoma 


bone 


rHL-7553 


Hs B21 Sk 


skin 


CRL-7552 


HS 820.T 


heterophil^ 
oateofi cation 


bona 


CRL-75S1 


Hs 820,Sk 


skin 


CRL-7444 


Ha 704.T 


osteosarcoma 


bone 


CRL-7443 


He 704.Sk 


skin 


CRL-7448 


Hs 707{A)T 


osteosarcoma 


bone 


CRL-7449 


Hfi 

707(B).£p 


skin 


CRL-7471 


Hs 735.T 


osteosarcoma 


bone 


CRL-7865 


HS 735,$k 


fiklft 


CRL-7571 


H6 836.T 


osteosarcoma 


bone 


CRL-7570 


Hs 836.5k 


skin 


CRL-7595 


Ha 860.T 


osteosarcoma 


bone 


CftL-7519 


Hs 791. Sk 


skin 
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CRL-7622 


Hs B88.T 


osteoaaredma 


bone 


CCL-211 


Hsaaau 


lung 


CRL-7626 


He 889.T 


osteosarcoma 


bone 


CRL-7625 


Hs 689.Sk 


skin 


CRL-7628 


Hs 6S0.T 


osteosarcoma 


bone 


CRL-7627 


Hfi 690.Sk 


sktn 


CRL-7453 


He 709.T 


periostitis; granuloma 


bone 


CRL-7452 


Hs 709.Sk 


skin 


CRL-7432 


H8 696J 


adenocarcinoma 


unknown 


CRL-7431 


He 696.$k 


skin 


CRL-76&6 


Hs 789T 


traditional cell 
carcinoma 


ureter 


CRL-7518 


Hs 769.Sk 


skin 


CRL-7547 


H$ 614.T 


giant cell sarcoma 


vsrtebral 
column 


CRL-7546 


Hs 8l4.Sk 


skin 



In another preferred embodiment of the invention, the cell type is a 
virus infected cell or cell line. Table 3 below provides examples of some 
known viruses and the diseases they cause; 

TABLE 3 



Diseases 


Viruses and other pathogens 


African sleeping sickness (African 
trypanosomiasis) 


Trypanosoma brucei 


ATPkC 
J\\\)& 


nlv 


Amebiasis 


Entamoeba histolytica 


BSE ("mad cow disease") and nvOD 




Campylobacter infections 


Campylobacter 


Chagas' disease (American trypanosomiasis) 


Trypanosoma cruzi 




viurjo cnyjciac 


Coccidioidomycosis 


Coccidioides immitis 


Cryptosporidiosis 


Cryptosporidium 


CycJosporiasis 


Cyclospora 


Dengue fever 


Dengue viruses 


Diphtheria, tetanus, and pertussis 


Toxin-producing strains of Corynebacterium 
diphtheriae 


Bordetclla pertussis 




Encephalitis 


Japanese encephalitis virus Tickborae 
encephalitis West Nile vims 


Fttariasis 


Wucbereria bancrofti and Brugia malayi 


Giardiasis (Giardia infection) 


Giardia intestinalis 


Hantavirus pulmonary syndrome 


Hantavirus 


Hepatitis 


Hepatitis viruses A, B ( C, E 


Histoplasmosis 


Histoplasma capsulaturn 


Influenza (flu) 




Leishmaniasis 


Leishmania 


Leptospirosis 


Leptospira 


Lyme disease 


B. burgdorferi sensu stricto, B. afzelii, or B. 
garinii 


Malaria 


Plasmodium falciparum P. vivax P, ovale and P. 
malariac 


Measles, mumps, and rubella (MMR) 




Meningitis 


Haemophilus influenzae type b Streptococcus 
pneumoniae and Neisseria meningitidis 


Onchocerciasis (rivCT blindness) 


Onchocerca volvulus 


Plague 


Yersinia pestis 
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roiiornycJiiis 




Kaoies 


, — — — 

Rhabdovindae, genus uyssavirus 


Rocky Mountain spotted fever 






nckettsi] 


severe di&rrhea 


jvuia virus 


Salmonellosis 


Salmonella 


Schistosomiasis 




Shiffellft<:i<: 


Shigella 


Tuberculosis (TB) 


Mycobacterium tuberculosis 


Typhoid fever 


Salmonella serogroup Typhi 


Typhus fevers 


Tickettsiae 


chickenpox 


Varicella 


Vibrio parahaemolyticus 




Viral hemorrhagic fevers (e.g., Ebola, Lassa, 
Marburg, Rift Valley). 


arenaviruses, filov trusts, bunyaviruses, and 
flaviviruses 


Yellow fever 





IB 
Si 

in 

s 

q 

in 



In yet another preferred embodiment of the present invention, the cell 
type is a cell involved in a development and/or progression of an autoimmune 
diseases such as T or B cells, and/or an allergic disease or condition, such as 
mast cells. 

In one example, the at least one protein of interest is a tumor associated 
antigen. The tumor associated antigen can be natively expressed by the cells 
or can be expressed by appropriately transformed cells. Table 4 below lists 
some known genes encoding proteins which were identified as tumor 
associated antigens. 



Gene Symbol 


Gene Name 




Disorders 










AfiLt 


v-abf Ahelson murine leukemia viral 
oncoaene homoloa 1 


9q34.1 


Leukemia, chronic myeloid 


ABU 


v-abl Abelson murine leukemic viral 
oncogene homolog 2 (ar& Abel son-rotated 
cena) 


Iq24<i25 


Leukiemia, acute myeloid, with eosinophil ta 


AK.T2 


v-alct murine thymoma viral oncogene 
homolOF.2 


]$q!3.i-ql3,2 


Ovarian carcinoma 


ARH1 
ARP 


ras homulofl pen© family, member 1 


l£31 

3p21.1 


Ovarian , _ 
Pantffefctic cancer 


AXTN2 


axin 2 (conducttn, axil) 


1?a23-q24 _ 


Colorectal cancer 
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HAY 


O^L^SSSUUmtCU A protein 






Colorectal ttinccr 

T-j»i»lt arnti* iunmhnhlitatir fDnL'Mmin 


BCPR 








BRCA1 




17q2l 


Dv&rian center 
Broast-Ovunan cancer 


BRCA2 


^rPiKt fiari cp r 2 earlv an<£r 


I3q12,3 


3 reus t cancer 2, early onset 
Pancreatic cancer 


BRCA3 




Ilq23 


Breast cancer-3 


BRCA4 






^rmiit c-ftftfit+r tuna 4 


BRCAX 






Breast cancer, type 4 






13 


srcfLSL cHiiucr. aucmi 


BRCD2 




Ip36 


Breast cancer, ductal 


BUB1 


buddine uninhibited by benzi imidazoles 1 
'yeast honioloft) 


2ql4 


Colorectal cancer with chromosomal 

nsiu unity 


CDH1 


cadherln 1, typo I. E^tidhcrui (epithelial) 


16q22.1 


Endometrial earcinomft 
Ovarian carcinoma 




CLD 


congenita! chloride diarrhea 




lajiuh cancer 

Chloride diarrhea, congenital. Finnish type* 


CSF1R 


• 

colony stimulating factor 1 receptor, rortntrly 
McDonough feline sarcoma vir.il (v-f'ms) 

nnmdAnti* hrtmoloc 


5a33 2-o33 3 


Myeloid malignancy, predisposition to 


CTNNBJ 


tatcnin (cadhefin-oSHOciated protein), beta t 


3p22-p21.3 


Colorectal cancer 

Hepatoblastoma 

Pilomatricoma 


CYLD 




16al2-ol3 


Cylindromtitosia, familial 


DCC 


deleted in colorectal carcinoma 


I8q21,3 


Colorectal cancer 


D£K 


DEK oncogene (DNA binding^ 


6d23 


Leukemia, acute ndn lymphocytic 


DUaCl 


deleted in lung and esophagi' cancer 1 




Lung cancer 

E?ophftKeal cancer 


DMBT1 


deleted in malignant brain tumors 1 


lOn^S ^>a2£ 1 


Glioblastoma multiforme 
Medullobl&stoma 


DRA 


down-regulated in adenoma 


7q22-q31.1 


Colon cancer 

Chloride diarrhea, congenital. Finntah typo, 


ELAC2 


elaC (£■ coli) homolog 2 


17p 


Prostate cancer, susceptibility to 


EP300 


El A binding protein n300 


22ot3 




ESR1 


estrogen receptor 1 


6q25,l 


Breast cancer 


ETV6 




12pl3 


Leukemia, acuto lymphoblastic 


FSHR 


follicle stimulating hormone receptor 


2p2l-pl6 


Premature ovarian failure 


HNPCC7 


3546 


15q21.1 


Colorectal cancer, hereditary non polyposis* 
tvoe7 — 


HPCI 


hereditary prostate canter l 


1024-0,25 


Prostate cancer, ausceptibilitv to 




hereditary prostate cancer, X-linked 


Xq2?Ki2a 


Prostate cancer, susceptibility to 


KPCX 
HRAS 


v-Hfl-ras Harvey rat sarcoma viral onco&ene 
homolog 


11 pi 5.5 


Bladder cancer 


HRPT2 


hyperpiiruthyroidism 2 (with jaw tumor) 


Iq25-q31 


Hypcrparathyroidism-jaw tumor syndrome 


KAJl 


kangai 1 (suppression of tumorigenicity 6. 
prostate: CM2 ftnti&eo (R2 leukocyte 
antigen, antigen detected by monoclonal and 


Llp1K2 


Prostate cancer, susceptibility to 
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5 r3 



JJWfS. 



h 



KIT 


v-kit Jffardy-Zuckcrman 4 feline iurtoma viral 
oncogene homolog 


4qJ2 


PietjaJdism 
Mast cell leukemia 
Mastocytosis with associated 


KRAS1P 


v-Ki-raal Kirsten mc sarcoma 1 viral 
oncogene homolog, processed oseudogene 


I2pl2.l 


Colorectal adenoma 
Colorectal cunccr 


KRAS2 


v-K.i-ros2 Kirsten rat sarcoma 2 viral 
oncoRene homolog 


12pl2.1 


Colorectal adenoma 
Colorectal cancer 


LCFS2 


mitochondrial ribosnmal protein LI 3 


I8ql l-ql2 


? Lynch cancer family syndrome TT 


ICO 


liver cancer oncogene 


2q 14-0.21 


Hepatocellular carcinoma 


MADH4 


MAD (mother ugainst dccapentaplegic. 
Droaophila) homo) or 4 


]8q2u 


Pancreatic cancer 
Polyposis, juvenile intestinal 


MCC 


mutated in colorectal cancers 


Sq2\ 


Colorectal cancer 


MERTK 


c-mcr protooncoRene tyrosine Icinti&e 


2ql4,l 


Retinitis pigmentosum MERTK- related 


MET 


met proto-oncogene (hepatocyte growth 
factor receptor) 


7q3l 


Renal cell carcinoma, papillary, familial and 
sporadic 


MGCT 




I2q22 


Male germ cell tumor 


MLHL 


mutL (£. coli) hamolog 1 (colon cancer. 
nOnpolyposis type 2) 


3p2l 3 


Colorectal cancer, hereditary non polyposis, 
W* 2 


MPL 


myeloproliferative leukemia virus oncogene 


Ip34 


Thrombocytopenia, congenital 
amegakaryocytic 


MSH2 


mutS (E. coli) homolog 2 (colon Cuncer, 
nonpolyposis type 1) 


2p22-p21 


Colorectal cancer, hereditary nonporyposis, 
^ ] 


MSH6 


mutS (£. coli) homolog 6 


2pl6 


Cancer susceptibility 
Endometrial carcinoma 
Colorectal 


MTACRI 


multiple tumor-a&£Oci(LEed chromosome 
region 1 


MplS.5 


Wilmi tumor, type 2 

Adrenocortical carcinoma, hereditary. 202300 


myc 


v-myc avian myelocytomotosui viral 
oncogene homolog 


8q24, 12^24, 13 


Burkitt lymphoma 


NRAS 


neuroblastoma RAS viral (v-ras) oncogene 
hnmclos 


1pl.V2 


Colorectal cancer 


PCAP 


predisposing tor prostate cancer 


Iq42.2-q43 


Prostate cancer, susceptibility to 


PCfiC 


3475 


lp36 


Prostate cancer, susceptibility lo 


POOfB 


platelet-derived growth factor beta 
polypeptide (simian sarcoma viral (v-sk) 
Oncogene homolog) 


22ql2.3»q13.1 


Meningioma, SlS-rclated 
Dermatofibrosarcoma protuberans 


P0GFRL 


platelet-derived growth factor receptor-like 


8p22-p21.3 


Hepatocellular cancer 
Colorectal cancer 


PGU 


paraganglioma or familial glomus tumors 2 


Ilql3.t 


Paraganglioma, familial nonchromaffin 


PCL3 


paraganglioma or familial glomus tumors 3 


la.21 


. .Paragangliomas, familial nonchromaffin^ 3 


PHB 


prohihitin 


17q21 


Breast cancer, sporadic 


PIK3CA 


phoaphoinoaitidc-3 -kinase, catalytic, alpha 
polypeptide 


3q26.3 


Ovarian cunccr 


PMS1 


postmeiotie segregation increased (5. 
cerevisiae) 1 


2q31-q33 


Colorectal cancer, hereditary nonpolyposis, 

fypc3 . 


PMS2 


postmeiotic segregation increased (5, 
cerevisiac) 2 


7p22 


Turcot syndrome with glioblastoma 
Colorectal cancer, 


PPP2R1B 


protein phosphatase 2 (formerly 2 A), 
regulatory subunit A (PR 65), beta isoform 


1 1 q22-q24 


Lung cancer 


PRCAI 


prostate cuncer 1 


Iq24-q25 


PrOStute cancer, susceptibility to 


PRKCA 


protein kinusc C t alpha 


I7q22-q23 2 


Pituitary tumor, invasive 


PTEN 


phosphatase and tens in homolog (mutated in 
multiple advanced cancers 1) 


t0q23.3 


Cowden disease 
Lhermitte-Duelos syndrome 


PTPNI2 


protein tyrosine phosphatase, non-receptor 
type 12 


7qll,23 


Colon cancer 
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(Ad27A, IDCn\w" ka* oncogene [umuy 


Sa21 < 


SrisceMi syndrome _ 


RAD5I 


ELAD51 (S. cerevisiae) homolog (E coli RccA 
homolog) 


150,15 1 


Breast cancer, susceptibility to 


RAD54L 


(5,cerevisiBG)-uice 


Ip32 


Lymphoma, non-Hodgkin 

Breast cancer, invasive intraductal 


RB1 


FA^rtAklnrfnmh 1 /in/rli til 1 nil ACfAACArf^ftH^fl I 

CtlnODiasxoma i ^nciuuing u&icuwuwuiihi/ 


1 3al4, 1 -q14.2 


Retinoblastoma 
Osteosarcoma 
Bladder cancer. 


RET 


ret protcK>ncogene (multiple endocrine 

nMnlnolfi tinA mfHiilliHlV fhWAtd CarCitlOflia 1. 
iGOujQJjlU UllU IlJwUUIIuljr Mljrlviu vulvlHVHPt* ■] 

Hirschsprung disease) 


IO4H 2 


Multiple endocrine neoplasia IIA 
Medullary thyroid 


R.UNX1 


mnt.ml'itjad tmvtcrintion factor 1 f acute 
myeloid leukemia 1; umM oncoaene) 


21q22.3 


Leukemia, acute myeloid 
Platelet disorder, familial, with 


SCLC1 


354 


3o23-d21 


Small-cell cancer of lung 


SLC22A1L 


solute carrier family 22 (organic cation 
transporter), member 1-1 ike 


1 1 pi ^ 5 


Breast cancer 

Rhabdomyosarcoma 

Luna 


SLC26A3 


•i k. * " 
solute Currier family 26 , member 3 


7a22-a31 I 


Colon cancer 

Chloride diarrhea, congenital, Finnish type 




dependent regulator of chromatin, subfiimiry 
b, member 1 


22q1 1 


Rhabdoid tumors 

Rhabdoid predisposition syndrome, familial 


SRC 


v-src avian sarcoma (SchmidfrRuppin A -2) 
viral oncogene homolog 


20q]2^|3 


Colon cancer, advanced 


SSTR2 


somatostatin receptor 2 


17q24 


Lung cuncer, small cell 


STU 




3d2S 


Pancreatic endocrine tumors 


ST12 


Suppression of tumori gen i city 12 (prostate) 


lOpter-qll 


Prostate adenocarcinoma 


o>r>* 


lunnre^iinn of tumnrittcnicitv 3 


1U13 


Cervical carcinoma 


ST* 


suppression of tumorigeniciry 8 (ovarian) 


6q26-q27 


Ovarian cancer, serous 


TACSTD2 


tumor-associated calcium signal transducer 2 


lD32-ql2 


Corneal dystrophy, fcelatintiuu drop-like 


TCP7L2 


trariSCnpllOn lllCior MIKc A \\. m xX\\ bpeuiiw, 
HMO-box) 


tOo.25.3 


Colorectal cancer 




vransiormtrtD gruwin iscwr, pum rwsjnw \\ 
(70-oOkD) 


3p22 


Colon cancer 

Colorectal cancer, hereditary nonpolyposis! 
type 6 


THPO 


thrombopoietin (myeloproliferative leukemia 

vim« nnfrtWnP liffinri mfcPlllrtlfVOCVte PfOWth 

and development factor) 


3q26.3-q27 


Thrombpcythemia, essential 


tviddCcm fin 


[Limor necroaia iucuji ici#cpiui aui^iiuiimjr. 

member 'Ob 


8p22-p21 


Squamous cell carcinoma* head and neck 


INrKir 1 J A 


[UITiOr ncCrvala lULlUi rcucpwr au^imiiiMjr, 


1 gq22. 1 


Osteolysis, familial expansile 
Pu^et disease of bone, 


INfRarl a 


tumor nocnrbib uiwuir rcucpiui aupcipnumy, 
member I A 


I2p132 


Periodic fever, familial 


TNFRSF6 


tumor necrosis factor receptor fujperfamily, 
rnciTiDGr o 


10q24.l 


Autoimmune lymp ho proliferative syndrome 


TNFSF5 


tumor necrosis factor (ligund) superfomily, 
member 5 (hyper-TgM syndrome) 


Xo.26 


Immunodeficiency* X-linked, with hypcr-lgM 


TNFSF6 


tumor necrosis factor (li&cmd) superfamily, 
member 6 


1q23 


Systemic lupus erythematosus. Susceptibility 

to 


TNF 


tumor necrosis factor {TNF superfamily, 
member 2) 


6p21.3 


Malaria, cerebral, susceptibility to 
Septic shock 


IOC 


tylosis with oesophageal cancer 


I7q24 


Tylosis with esophageal Ounce r 


TP53 


tumor protein p53 (Li-f remmeni syndrome) 


17p»3.l 


Colorectal cancer 
Li-Fraumeni syndrome 
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TP73 


tumor protein p?3 


1 D 36 


Neuroblastoma 


TSGlOl 


tumor susceptibility scne 101 


Up!5.2-p]5.1 


Breast cancer 


VMGLOM 


venous mnl formation with ftlomus cells 


Ip22-p2l 


Glomus tumors, multiple 


WTI 


Wilms tumor 1 


1lp13 


Wilms tumor, type 1 
Denys-Drash syndrome 
Prosier 


WT2 


Wilms tumor 2 


Mpt5.S 


Wilms tumor, type 2 
Adrenocortical carcinoma, hereditary 



In another example, the at least one protein of interest includes a 



cytokine. Many diseases, including neurodegenerative (e.g., Alzheimer's 
disease) and autoimmune (e.g., rheumatoid arthritis, multiple sclerosis and the 
like) diseases are caused or accompanied by inflammation, resulting in 
S infiltration of leukocytes into the inflicted tissue(s). In these diseases 
p proinflammatory cytokines and chemokines are believed to play a pivotal role 

W in the attraction of leukocytes to the site of inflammation and in the initiation 

HI 

jpl and progression of the inflammatory process. In rheumatoid arthritis, for 

J example, the role of proinflammatory cytokines, particularly TNF-o and IL-1, 

q 10 in disease manifestation has been intensively studied and explored in 

jy experimental models that have been expanded to clinical trials, Other 

O cytokines such as IL-4, TGF-p, IL-8, IL-17, IL-10, IL-11, IL-12 and IL-15 

have also been implicated in the regulation of rheumatoid arthritis. Such 

regulation can be attributed to either their direct effect on disease 
15 manifestation, their synergistic effect with other proinflammatory 

cytokines/chemokines, or their involvement in the regulation of chemokine 

transcription, and production. 

Chemokines are chemoattractant cytokines that mediate leukocyte 

attraction and recruitment at the site of inflammation. Based on the positions 
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of the first two cysteines, chemokines can be divided into four highly 
conserved but distinct supergene families, C-C, C-X-C, C and C-X3-C. The 
C-C family is primarily involved in the activation of endothelium and 
chemoattraction of T cells and monocytes to the site of inflammation. The 
5 protective competence of anti-C-C chemokine based immunotherapy has been 
demonstrated in experimental autoimmune encephalomyelitis (EAE), and 
rheumatoid arthritis. 

Neutralizing the activity of chemokines as a way to treat various 
diseases has been explored by many researchers. For example, in a recent 

C5 10 study neutralizing antibodies to epithelial neutrophil activating peptide 78 

tn 

Hj (ENA-78) were found capable of inhibiting the development of AA if 

til 

|T administered before but not after the onset of disease [92], In another recent 

q study, Barnes et al [93] used anti-human RANTES to ameliorate AA in the 

\n 

fy Lewis rat. Gong et al [94] used an antagonist of Monocyte Chemoattractant 

p 15 Protein 1 (MCP-1) to inhibit arthritis in the MRL-lpr mouse model. Using a 
streptococcal cell wall induced arthritis model it has been shown that anti-EL-4 
and anti MCP-1 antibodies block the disease [95]. The same study 
demonstrated that neutralizing IL-4 by itself, leads to a marked reduction in 
MCP-1 mRNA transcription at the autoimmune site and to inhibition of the 
20 development of disease which further implicates MCP-1 in playing an active 
role in arthritis development, 
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In yet another example, the at least one protein of interest includes a 
protein, e.g., a surface protein, of a pathogen, such as a viral pathogen, a 
bacteria] pathogen or a parasite (either mono or multicellular parasite). 

The major histocompatibility complex (MHC) is a complex of antigens 
encoded by a group of linked loci, which are collectively termed H-2 in the 
mouse and HLA in humans. The two principal classes of the MHC antigens, 
class I and class II, each comprise a set of cell surface glycoproteins which 

5 play a role in determining tissue type and transplant compatibility. In 
transplantation reactions, cytotoxic T-cells (CTLs) respond mainly against 
foreign class I glycoproteins, while helper T-cells respond mainly against 
foreign class II glycoproteins. 

Major histocompatibility complex (MHC) class I molecules are 

10 expressed on the surface of nearly all cells. These molecules function in 
presenting peptides which are mainly derived from endogenously synthesized 
proteins to CD8+ T cells via an interaction with the op T-cell receptor. The 
class I MHC molecule is a heterodimer composed of a 46-kDa heavy chain 
which is non-covalently associated with the 12-kDa light chain {5-2 

\5 microglobulin. Class I MHC-restricted peptides, which are traditionally 
assumed to be 8-10-amino acid-long, bind to the heavy chain ol-o2 groove via 
two or three anchor residues that interact with corresponding binding pockets 
in the MHC molecule. The p-2 microglobulin chain plays an important role in 
MHC class I intracellular transport, peptide binding, and conformational 
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stability [76]. For roost class I molecules, the formation of a heterodimer 
consisting of the MHC class I heavy chain, peptide (self or antigenic) and |3-2 
microglobulin is required for biosynthetic maturation and cell-surface 
expression [76], 

5 Research studies performed on peptide binding to class I MHC 

molecules enable to define specific MHC motifs functional in displaying 
peptides derived from viral or tumor antigens that are potentially immunogenic 
and might elicit specific response from cytotoxic T lymphocytes (CTLs) 

m 10 Soluble MHC multimers posses a high avidity for T-cells since they 

II provide multi-point binding of TCRs with their MHC-peptide ligands. As 

such, multimeric forms (tetramers) of MHC-peptide complexes have been the 
center of much interest recently, because they can be used for direct 
phenotypic characterization of T cell responses in normal as well as 
is pathological conditions, thus, providing insight into the pathopysiology and 
mechanisms of various diseases. Recombinant soluble and secreted MHC 
class I and class II complexes including single chain MHC are described in 
[79-91] which are incorporated herein by references. 

There are several thousands of MHC genes, some of which were 
20 cloned. Table 5 below associates the MHC genes into classes and types (6). 
The sequences of the known MHC genes can be found in the Kabat database 
ChttD://immuno.bme,nwu.eduA> . 
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TABLES 





Type 


Number of genes 


MHC Class I 


A, B,C 


1014 


MHC class II A chain 


DRDQDP 


348 


MHC class II B chain 


DRDODP 


1680 



Genes encoding MHC of particular haplotypes can be readily isolated 
5 using techniques well known in the art and reconstructed to encode soluble 
MHC molecules essentially as exemplified in the Examples section that 
follows. Such well known techniques include, for example, PCR 
amplification, enzymatic digestion and ligation. 
m According to a presently preferred embodiment of the present invention 

gj analyzing the peptides bound to the soluble and secreted form of the MHC 

?^ 

HI molecules of the particular haplotype is by mass spectrometry, mass charge 

in 

»S ratio and collision induced disintegration. Edman degradation can also be 

* employed in certain cases where a sufficient amount of the pure peptide 

O 

2 1 becomes available. 

pi The identification of the amino acid sequence of a peptide in 

accordance with the teachings of the present invention is preferably effected 
by comparison of the data collected by mass spectrometry, mass charge ratio 
and collision induced disintegration to putative data of mass spectrometry, 
mass charge ratio and collision induced disintegration of known proteins. 

As used herein in the specification and in the claims section below the 
10 term "peptide" includes native peptides (either degradation products or 
synthetically synthesized peptides) and further to peptidomimetics, such as 
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peptoids and semipeptoids which are peptide analogs, which may have, for 
example, modifications rendering the peptides more stable while in a body, or 
more immunogenic. 

Such modifications include, but are not limited to, cyclization, N 

5 terminus modification, C terminus modification, peptide bond modification, 
including, but not limited to, CH 2 -NH, CH 2 -S, CH 2 -S=0, 0=C-NH, CH2-O, 
CH2-CH2, S=C-NH, CH=CH or CF=CH, backbone modification and residue 
modification. Methods for preparing peptidomimetic compounds are well 
known in the art and are specified in Quantitative Drug Design, C.A. Ramsden 

10 Gd„ Chapter 17,2, F, Choplin Pergamon Press (1992), which is incorporated 
by reference as if fully set forth herein. Further detail in this respect axe 
provided hereinunder. 

As used herein in the specification and in the claims section below the 
term ''amino acid" is understood to include the 20 naturally occurring amino 

15 acids; those amino acids often modified post-translationally in vivo, including 
for example hydroxyproline, phosphoserine and phosphothreonine; and other 
unusual amino acids including, but not limited to, 2-aminoadipic acid, 
hydroxylysine, isodesmosine, nor-valine, nor-leucine and ornithine. 
Furthermore, the term "amino acid" includes both D- and L-amino acids. 

20 Further elaboration of the possible amino acids usable according to the present 
invention and examples of non-natural amino acids useful in MHC class I, type 
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A2„ recognizable peptide antigens are given hereinunder. Other anchor 
residues are known for other MHC molecules. 

Thus, assume the following positions (P1-P9) in a 9-mer peptide: 
P1-P2-P3-P4-P5-P6-P7-P8-P9 
5 The P2 and P9 positions include the anchor residues which are the main 

residues participating in binding to A2 MHC molecules. Amino acid resides 
engaging positions P2 and P9 are hydrophilic aliphatic non-charged natural 
amino (examples being Ala, Val, Leu, He, Gin, Thr, Ser, Cys, preferably Val 
and Leu) or of a non-natural hydrophilic aliphatic non-charged amino acid 

0 

S 10 (examples being norleucine (Nle), norvaline (Nva), oaminobutyric acid), 

IB 

H-! Positions PI and P3 are also known to include amino acid residues which 

Hi 
til 

g participate or assist in binding to MHC molecules, however, these positions 

^ can include any amino acids, natural or non-natural. The other positions are 

ill engaged by amino acid residues which typically do not participate in binding, 

ni 

*5 15 rather these amino acids are presented to the immune cells. Further details 

0 

M relating to the binding of peptides to MHC molecules can be found in 

reference 117, see Table V thereof, in particular. 

Hydrophilic aliphatic natural amino acids at P2 and P9 can be 
substituted by synthetic amino acids, preferably Nleu, Nval and/or o 
20 -aminobutyric acid. P9 can be also substituted by aliphatic amino acids 
of the general formula -HN(CH2) n COOH, wherein n - 3-5, as well as by 
branched derivatives thereof, such as, but not limited to, 
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-NH(CH 2 ) n -COON 

I 

R 

5 wherein R is, for example, methyl, ethyl or propyl, located at any one or more 
of the n carbons. 

The amino terminal residue (position PI) can be substituted by 
positively charged aliphatic carboxylic acids, such as, but not limited to, 
H2N(CH2) n COOH, wherein n - 2-4 and H 2 N~C(NH)-NH(CH2) n COOH, 

io wherein n = 2-3, as well as by hydroxy Lysine, N-methyl Lysine or ornithine 
(Orn), Additionally, the amino terminal residue can be substituted by enlarged 
aromatic residues, such as, but not limited to, H2N-(C6Hg)-CH2-COOH, 
p-aminophenyl alanine, H2N-F(NH)-NH-(C6H6)-CH 2 -COOH ? 

p-guanidinophenyl alanine or pyridinoalanine (Pal). These latter residues may 

15 form hydrogen bonding with the OH' moieties of the Tyrosine residues at the 
MHC-1 N-terminal binding pocket, as well as to create, at the same time 
aromatic-aromatic interactions. 

Derivatization of amino acid residues at positions P4-P8, should these 
residues have a side-chain, such as, OH, SH or NH2, like Ser, Tyr, Lys, Cys or 

20 Orn, can be by alky], aryl, alkanoyl or aroyl. In addition, OH groups at these 
positions may also be derivatized by phosphorylation and/or glycosylation. 
These derivatizations have been shown in some cases to enhance the binding 
to the T cell receptor. 
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Longer derivatives in which the second anchor amino acid is at position 
P10 may include at P9 most L amino acids. In some cases shorter derivatives 
are also applicable, in which the C terminal acid serves as the second anchor 
residue. 

5 Cyclic amino acid derivatives can engage position P4-P8, preferably 

positions P6 and P7. Cyclization can be obtained through amide bond 
formation, e.g., by incorporating Glu, Asp, Lys, Orn, di-amino butyric (Dab) 
acid, di-aminopropionic (Dap) acid at various positions in the chain (-CO-NH 
or -NH-CO bonds). Backbone to backbone cyclization can also be obtained 

10 through incorporation of modified amino acids of the formulas 
H-N((CH2) n -COOH)-C(R)H-COOH or H-N((CH 2 ) n -COOH)-C(R)H-NH2^ 



,g wherein n = 1-4, and further wherein R is any natural or non-natural side chain 



of an amino acid, As stated above, the data presented herein relates to the 
residues of the most abandoned MHC molecule - MHC class I, type A2. This 

is data was collected over the years via the detailed analysis of thousands of 
peptides that bind to MHC-I, A2. It will be appreciated that the method of the 
present invention allows the collection of data and analysis of peptides that 
bind any other to MHC molecule. 

Cyclization via formation of S-S bonds through incorporation of two 

20 Cys residues is also possible. Additional side-chain to side chain cyclization 
can be obtained via formation of an interaction bond of the formula 
-(-CH2-) n -S-CH2-C- ! , wherein n = 1 or 2, which is possible, for example. 
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through incorporation of Cys or homoCys and reaction of its free SH group 
with, e.g., bromoacetylated Lys, Orn, Dab or Dap. 

Peptide bonds (-CO-NH-) within the peptide may be substituted by 
N-methylated bonds (-N(CH 3 )-CO-)> ester bonds (-C(R)H-C-0-0-C(R)-N-), 
5 ketomethylen bonds (-CO-CH2-), o-aza bonds (-NH-N(R)-CO-), wherein R is 
any alkyl, e.g., methyl, carba bonds (-CH2-NH-), hydroxyethylene bonds 
(-CH(OH)-CH2-) ? thioamide bonds (-CS-NH-), olefinic double bonds 
(-CH=CH-), retro amide bonds (-NH-CO-), peptide derivatives 
(-N(R)-CH2-CO-)> wherein R is the "normal" side chain, naturally presented 
10 on the carbon atom. 

These modifications can occur at any of the bonds along the peptide 
chain and even at several (2-3) at the same time, Preferably, but not in all 
cases necessary, these modifications should exclude anchor amino acids. 

Natural aromatic amino acids, Trp, Tyr and Phe, may be substituted for 
15 synthetic non-natural acid such as TIC, naphthylelanine (Nol), ring-methylated 
derivatives of Phe, halogenated derivatives of Phe or o-methyl-Tyr, 

Tables 6-7 below list all of the naturally occurring amino acids (Table 
6) and some of the non-conventional or modifi ed amino acids (Table 7). 
20 TABLE 6 



Amino Acid 


Three-Letter Abbreviation 


One-letter Symbol 


Alanine 


Ala 


A 


Arginine 


Arg 


R 


Asparagine 


Asn 


N 


Aspartic acid 


Asp 


D 


Cyatebc 


Cys 


C 


Glutamine 


Gin 


0 


Glutamic Acid 


Glu 


E 


Glycine 


Gly 


G 
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Histidine 


His 


H 


Isoleucine 


lie 


I 


Leucine 


Leu 


L 


Lysine 


Lya 


K 


Methionine 


Met 


M 


Phenylalanine 


Phe 


P 


Proline 


Pro 


P 


Serine 


Ser 


S 


Threonine 


Thr 


T 


Tryptophan 


Tip 


W 


Tyrosine 


Tvr 


Y 


Valine 


Val 


V 


Any amino acid as above 


Xaa 


X 



r 


fable 7 


Non-conventional amino acid 


Code 


Non-conventional amino acid 


Code 


Ct-ftitiindbtirvric acid 


Abu 


L-N-methylalanine 


Nmata 


Ct-amino-Ot^methylbutyrate 


Meabu 


L-N-methylarginine 


Nniarg 


aminocyclopropane- 


Lpro 


L-ri -meiny tasparagine 


|N jTit*5>n 


carboxylate 




L-jN-metnyiaspanic acio 


IN III aap 


aminnisohutvric acid 


Aib 


L-N-m ethy (cysteine 


Nmcys 


aminonorbornyl- 


Norb 


L-N-methylglutamine 


Nmgin 


carboxylate 




L-N-methylfilutamic acid 


Nmglu 


cyclohexylalanine 


Chexa 


L-N-methy (histidine 


Nmhis 


cyclopentylalanine 


Cpen 


L-N-methylisolleucine 


Nmile 


D- alanine 


jjai 


jl-in -m cmyiieu cm e 




0-arginine 


Darg 


L-N-methyllysine 


Nnilys 


D-a$partic acid 


Dasp 


L-N-metbylmethionine 


Nmmet 


D-cysteine 


Dcys 


L-N-methytnorleucine 


Nmnle 


D-&lutarnine 


Dgln 


L-N-metbylnorvaline 


Nmnva 


D-glutamic acid 


Dglu 


Lj-in -mctny lonuiu me 


TvTtyi Am 


D-h.istidine 


Dhis 


L-N-niethylphenylalanine 


Nmphe 


D-isoleucine 


Dilc 


L-N-methylproline 


Nmpro 


D-leucine 


Dleu 


L-N-methylserine 


Nniser 


D-Jysine 


Dlys 


L-N-methylthreonine 


Nmthr 


D-methionine 


Dmet 


L-N -methyl tryptophan 


Nmtrp 


D-omithine 


Dom 


L-N-methyltyrosine 


Nmtyr 


D-phenylalanine 


Dphe 


L-N~methyl valine 


Nmval 


D-proline 


Dpro 


L-N-methylethylglycine 


Nmetg 


D-serine 


Dser 


L-N-methyM-butyJ glycine 


Nmtbug 


D-threonine 


Dthr 


L-nor leucine 


Nle 


D-tryptopban 


Dtrp 


L-norvaline 


Nva 


D-tyrosine 


Dtyr 


a-methyl-aminoisobutyrate 


Maib 


D-valine 


Dval 


a-methyl-Y-aminobutyrate 


Mgabu 


D-a-methylalanine 


Dmala 


a-methylcyclohexylalanine 


Mchexa 


D-a-rnethylarginine 


Dmarg 


a-methylcycJopentylalanine 


Mcpen 


D-a-methylasparagine 


Dmasn 


a-metbyl-a-napthylalanine 


Manap 


D-a-metbylaspartate 


Dmasp 


a-methylpenicillamine 


Mpen 


D-Ot-methylcysteine 


Dmcys 


N'(4-aminobutyl)glycine 


Ngiu 


D-a-mcthylglutamLne 


Dmgln 


N-(2-aminoethyl)glycine 


Naeg 


D-a-mcthylhistidine 


Dmhis 


N-(3-aminopropyl)glycine 


Norn 
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D-a-methy!isoleucine 


Dmile 


N- amino-a-methylbutyrate 


Nmaabu 


D-a-methylleucine 


Dmleu 


a-napthylalanine 


Anap 


B-a-methyllysine 


Dmlys 


N-benzylgiycine 


Nphe 


D-Ctnnethy Methionine 


Dmmet 


N-(2-carbamyletbyl)glycine 


Ngln 


D-a-methylomithine 


Dmorn 


N-(carhamylmethyl)glycine 


Nasn 


D-a-methylpheuylalanwe 


Drophe 


N-(2-carboxyethy])glycine 


Nglu 


D-a-raethylproline 


Dmpro 


N^carboxymethyOglycirie 


Nasp 


D-a-methylserine 


Denser 


N-cyciobutylglycioe 


Ncbut 


D-a-methylthrecmine 


Dmthr 


N-cycloheptylglycine 


Nchep 


D-a-methyltryptophan 


Dmtrp 


N-cyclohexy]g1ycine 


Nchex 


D-a-methyltyrosine 


Dunty 


N-cyc3odecylg!ycine 


Ncdec 


D-Ol-methy 1 val in e 


Dmval 


N-cyclododecJglycine 


Ncdod 


D-Dt-methvlalnine 


Dnmala 


N-cycJoocrylgiycine 


Ncoct 


D-Cc-methy] ar^inine 




ldDrflu vl £rl vc ine 


Ncpro 


D-a-inethylasparagine 


Dnmasn 


N-cycloimdecylglycine 


Ncund 


D-a-methylasparatate 


Dnmasp 


N-(2 1 2-diphenylethyl)glycine 


Nbhm 


D-a-methylcysteine 


Dnmcys 


N-(3^-dipheny1prapyl)glycme 


Nbhe 


D-N-methylleucine 


Dnmleu 


N-(3-indolylyetbyl) glycine 


Nhtrp 


D-N-methyllysinfi 


Dnmlys 


M -m eth v 1 -V-fl m in n but vrAtft 


Nmgabu 


in • iTiciriy i cy cj on csxy i a i an in e 




T^«\J«iyi A^hv 1 vn prti i n n l n p 
i^*j>t"ijicu.iyiuicu]iuiijxic 


JVJ 11 LiiJlvl 


D -N-methyl onu thine 


jjniTiur n 


iN-mcmyicyMuponiYi&iqJiinc 




M-metnyigiycine 


JNaJa 


jj"iN ■me uiy j pcLeny i aj an ine 


U lull i\C 


N-meihy I amino isobutyrate 


Nmaib 


D-N«methylproline 


Dnmpro 


N-( 1 -metbylpropyl)glycjne 


Nile 


D-N-methylserine 


Dnmser 


N-(2-methyl propyl)glycine 


Nile 


D-N-methyl serine 


Dnmser 


N-(2-methylpropyl)g1ycine 


Nleu 


D-N-methylthreonine 


Dnmthr 


D-N-methyltryptophaji 


Dnmtrp 


N-(l -rnethylethyl)glycine 


Nva 


D-N-methyltyros ine 


Dnmtyr 


M-methyla-napthy IaJ an ine 


Nmanap 


D-N-methyl valine 


Dnmval 


N-methylpenici 11 amine 


rimpcn 


7-aminobutyric acid 


Gabu 


N-f D-hvdroxvDhenvnc] vctne 


Nhtyr 




1. uug 


N-(thioTnethyl)glycine 


Ncys 


L-ethylglycine 


Etg 


penicillamine 


Pen 


L-homophenylalanine 


Hphe 


L-a-Jtiethylalanine 


Mala 




Marg 


L-ct-methylasparagine 


Masn 


L-a-metliylaspartate 




L"Ct- methy 1-t-buty Igly cine 


Mtbug 


L-ct-metbylcysteine 


Mcys 


L-methy lethylgly cine 


jvieig 


L-a-methylglutaraine 


Mgln 


L-a-itiethy )g lutam ate 


Mglu 


L-a-methylhistidine 


Mhis 


L-a-methylhorao phenylalanine 


Mhphe 


L-ct-met3iy li soleucine 


Mile 


N-(2"methyltbioethyl)glycme 


Nmet 


T^-^-in pfh vl P 1 ut 3m i n 


Dnrngln 


N-(3-guanidinopropyl)glycuie 


Narg 


D-N-methylglutamate 


Dnmglu 


N-(l -hydroxyethy!)glycine 


Ntbr 


D-N-methylhistidine 


Drmnhls 


N-(hydroxyethyl)glycine 


Nser 


D-N-methylisoleuctne 


Dnmile 


N-(imidazoWlethyl)glycine 


Nhis 


D-N-meihylleucine 


Dnmleu 


N-(3-indolylyetbyl)glycme 


Nhtrp 


D-N-methyllysine 


Dnmlys 


N-nicthyl-Y-aminobutyrate 


Nmgabu 


N-methylcyclobexylalonine 


Nmchexa 


D-N-methylmethionine 


Dnmmet 


D-N-methylornithine 


Dnmom 


N-methylcyclopentylalanine 


Nmcpen 


N-methyl£lycine 


Nala 


D-M -methylphenylalanine 


Dnniphe 
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N-memylaminoisobutyrate 


Nmaib 


D-N-methylproline 


Dnmpro 


N-0 -methy lpropyl)ftIycme 


Nile 


D-N-methylserine 


Dnmser 






j^-i\-iJicuiyui.ucuoJiit? 




D-N-methyltryptophan 


Dmntrp 


N-(I -metbylethy l)glycine 


NvaJ 


D-N-methyltyrosine 


Dnmtyr 


N-methyla-napthylaJanine 


Nraanap 


D-N-methylvaline 


Dnmval 


N-methy!penicil!amine 


Nmpen 


Y-aminobutyric acid 


Gabu 


N-(p-hydroxyphenyl)glycine 


Nhtyr 


L-t-butylglycine 


Tbug 


N-(thiomethyl)filycine 


Ncys 


L-ethylglycine 


Et& 


penicillamine 


Pen 


L-ho monhen v 1 al&n i ne 


Hphe 


L-Ot-methy laianin e 


Mala 


L-a-methylarginine 


Marff 


L-Ct-methylasparagine 


Masn 


L-ct-methylaspaitate 


Ma$p 


L-tt-methyl-t-butylglycine 


Mtbug 


L-a-metbylcystewe 




L-m eth vl eth vl £ ivcine 


Metg 


L-Ot-methylglutamine 




L-Ot-methylglutamate 


Mglu 


L-a-methylhistidine 




L-d-methylhomophenylalanine 


Mhphe 


L-a-metbyli&oleucine 


Milp 

4.YIJJC 


N-^-methvlthi oethvn&lvcine 


Nmet 


L-a-methylleucine 


Mien 


L-a-methyJlystne 


Mlys 


L-a-methylnethiotiixie 


Mmet 


L-Ot-methylnorleucine 


Mnle 


j^r\jv iiiwujjr uiwi >*4fijfv 


Mnva 


L-ct-methy 1 ornithine 


Morn 


L-a-methylphenytalaiiine 


Mphe 


L-Ct-methylproline 


Mpro 


L-Ot-methylserine 


mser 


jj-cx-me my Jinreonin e 


Mthr 


L-a-methylvaline 


Mtrp 


L-a-methyltyrosine 


Mtyr 


L-a-methylleucine 


Mval 
Nnbhm 


L-N-methylhomophenyJalanine 


Nmhphe 


N-(N-{2,2-di&henYlethvl) 




N-fN-O^-diphenvlpropyl) 




cflrbamylmethyl-glycine 


Nnbhm 


carbamylmethyK I )glycioe 


Nnbhe 


1 -carboxy- 1 -(2,2-diphenyl 
ethylamino)cyclopropane 


Nmbc 







A peptide according to the present invention can be used in a self 
standing form or be a part of a larger moiety such as a protein or a display 
moieties such as a display bacterium, a display phage or preferably a display 
cell. 

Additionally, a peptide according to the present invention includes at 
least five, optionally at least six, optionally at least seven, optionally at least 
eight, optionally at least nine, optionally at least ten, optionally at least eleven, 
optionally at least twelve, optionally at least thirteen, optionally at least 
fourteen, optionally at least fifteen, optionally at least sixteen or optionally at 
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least seventeen, optionally between seventeen and twenty five or optionally 
between twenty five and at least thirty amino acid residues (also referred to 
herein interchangeably as amino acids). 

Accordingly, as used herein the term "amino acid" or "amino acids" is 
5 understood to include the 20 naturally occurring amino acids; those amino 
acids often modified post-translationally in vivo, including, for example, 
hydroxyproline, phosphoserine and phosphothreonine; and other unusual 
amino acids including, but not limited to, 2-aminoadipic acid, hydroxylysine, 
m isodesmosine, nor-valine, nor-leucine and ornithine. Furthermore, the term 

g 10 "amino acid" includes both D- and L-amino acids. 

As used herein the phrase "derived from a protein" refers to peptides 
derived from the specified protein or proteins and further to homologous 
peptides derived from equivalent regions of proteins homologous to the 
specified proteins of the same or other species, provided that these peptides are 
15 effective as vaccines, such as anti-tumor vaccines. The term further relates to 
permissible amino acid alterations and peptidomimetics designed based on the 
amino acid sequence of the specified proteins or their homologous proteins. 

As used herein the phrase "anti-tumor vaccines" refers to a vaccines 
effective in preventing the development of, or curing, cancer, including 
20 primary tumor and/or metastases. 

The peptides of the invention can be administered per se or as an active 
ingredient in a pharmaceutical composition which may further include a 
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pharmaceutical^ acceptable carrier. Preferably, one or more peptides of the 
invention are presented in context of an antigen presenting cell. The most 
common cells used to load antigens are bone marrow and peripheral blood 
derived dendritic cells (DC), as these cells express costimulatory molecules 
5 that help activation of CTL. Nevertheless, the peptide presenting cell can also 
be a macrophage, a B cell or a fibroblast. Presenting the peptide can be 
effected by a variety of methods, such as, but not limited to, (a) transforming 
the presenting cell with at least one polynucleotide (e.g,, DNA) encoding at 
least one peptide; (b) loading the presenting cell with at least one 
10 polynucleotide (e.g., RNA) encoding at least one peptide; (c) loading the 
presenting cell with at least one peptide (e.g., synthetic peptide); and (d) 
loading the antigen presenting cell with at least one longer polypeptide (e.g., 
purified) including at least one peptide. Loading can be external or internal. 
The polynucleotide, peptide or longer polypeptide can be fused to internalizing 
S? 15 sequences* antennapedia sequences or toxoid sequences or to helper 

sequences* such as, but not limited to, heat shock protein sequences. 

As used herein a "pharmaceutical composition" refers to a preparation 
of one or more of the peptides described herein, with other chemical 
components such as pharmaceutically suitable carriers and excipients. The 
20 purpose of a pharmaceutical composition is to facilitate administration of a 
compound to a subject. 
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Hereinafter, the term "pharmaceutically acceptable carrier" refers to a 
carrier or a diluent that does not cause significant irritation to a subject and 
does not abrogate the biological activity and properties of the administered 
compound. Examples, without limitations, of carriers are propylene glycol, 
5 saline, emulsions and mixtures of organic solvents with water. 

Herein the term "excipient" refers to an inert substance added to a 
pharmaceutical composition to further facilitate administration of a compound. 
Examples, without limitation, of excipients include calcium carbonate, calcium 
phosphate, various sugars and types of starch, cellulose derivatives, gelatin, 
10 vegetable oils and polyethylene glycols, 

i 

yi According to a preferred embodiment of the present invention, the 

jp pharmaceutical carrier is an aqueous solution of lactic acid. 

In this respect, it should be pointed out that some of the peptides of the 
present invention, according to preferred embodiments, are readily soluble in 
15 aqueous media and are therefore easily formulated. 

Techniques for formulation and administration of drugs may be found 
in "Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, 
latest edition, which is incorporated herein by reference. 

Suitable routes of administration may, for example, include oral, rectal, 
20 transmucosal, transdermal, intestinal or parenteral delivery, including 
intramuscular, subcutaneous and intramedullary injections as well as 
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intrathecal, direct intraventricular, intravenous, intraperitoneal, intranasal, or 
intraocular injections. 

Pharmaceutical compositions of the present invention may be 
manufactured by processes well known in the art, e.g., by means of 
5 conventional mixing, dissolving, granulating, dragee-making, levigating, 
emulsifying, encapsulating, entrapping or lyophilizing processes. 

Pharmaceutical compositions for use in accordance with the present 
invention thus may be formulated in conventional manner using one or more 
pharmaceutical^ acceptable carriers comprising excipients and auxiliaries, 

JS? io which facilitate processing of the active compounds into preparations which 

*&? 

fn 

frj can be used pharmaceutical^. Proper formulation is dependent upon the route 

> of administration chosen. 

03 

a For injection, the peptides of the invention may be formulated in 

Issf 

III aqueous solutions, preferably in physiologically compatible buffers such as 

*0 15 Hank's solution, Ringer's solution, or physiological saline buffer with or 
?R& without organic solvents such as propylene glycol, polyethylene glycol and the 

like. For transmucosal administration, penetrants are used in the formulation. 
Such penetrants are generally known in the art. 

For oral administration, the peptides can be formulated readily by 
20 combining the active compounds with pharmaceutical^ acceptable carriers 
well known in the art. Such carriers enable the peptides of the invention to be 
formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, 
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suspensions, and the like for ^ ■ 
Preparations for oral u«e i. , 

gnndmg the resulting mixture anw 

g mixture, and processing the raixture ^ ^ 

l-ose, sucrose, mannitol, or sorbitol; cellulose preparations Such ^ fof 
example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum 
tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium 
carbomethylcellulose and/or physiologically acceptable polymers such as 
10 polyvinylpyrrolidone (PVP). If desired, disintegrating agents may be added, 
such as cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt 
thereof such as sodium alginate. 

Dragee cores are provided with suitable coatings. For this purpose, 
concentrated sugar solutions may be used which may optionally contain gum 
is arabic, talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, titanium 
dioxide, lacquer solutions and suitable organic solvents or solvent mixtures. 
Dyestuffs or pigments may be added to the tablets or dragee coatings for 
identification or to characterize different combinations of active compound 
doses. 

20 Pharmaceutical compositions, which can be used orally, include 

push-fit capsules made of gelatin as well as soft, sealed capsules made of 
gelatin and a plasticizer, such as glycerol or sorbitol. The push-fit capsules 
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may contain the active ingredients in admixture with filler such as lactose, 
binders such as starches, lubricants such as talc or magnesium stearate and, 
optionally, stabilizers. In soft capsules, the active compounds may be 
dissolved or suspended in suitable liquids, such as aqueous solution, fatty oils, 
5 liquid paraffin, or liquid polyethylene glycols. In addition, stabilizers may be 
added. All formulations for oral administration should be in dosages suitable 
for the chosen route of admiiiistration. 

For buccal administration, the compositions may take the form of 
tablets or lozenges formulated in conventional manner, 
g io For administration by inhalation, the peptides are conveniently 

rq delivered in the form of an aerosol spray presentation from a pressurized pack 

II! 

J or a nebulizer with the use of a suitable propellant, e.g., 

0 

g dichlorodifluoromethane, trichlorofluoromethane, dichloro-tetrafluoroethane 

0 

or carbon dioxide. In the case of a pressurized aerosol, the dosage unit may be 
15 determined by providing a valve to deliver a metered amount* Capsules and 
cartridges of, e.g., gelatin for use in an inhaler or insufflator may be 
formulated containing a powder mix of the compound and a suitable powder 
base such as lactose or starch. 

The peptides described herein may be formulated for parenteral 
20 administration, e.g., by bolus injection or continuos infusion. Formulations for 
injection may be presented in unit dosage form, e.g., in ampoules or in 
multidose containers with optionally, an added preservative. The compositions 
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may be suspensions, solutions or emulsions in oily or aqueous vehicles, and 
may contain formulatory agents such as suspending, stabilizing and/or 
dispersing agents. 

Pharmaceutical compositions for parenteral administration include 
5 aqueous solutions of the active compound in water-soluble form. 
Additionally, suspensions of the active compounds may be prepared as 
appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles 
include fatty oils such as sesame oil, or synthetic fatty acids esters such as 
p ethyl oleate, triglycerides or liposomes. Aqueous injection suspensions may 

y9 

l| io contain substances, which increase the viscosity of the suspension, such as 

y » 

y! sodium carboxymethyl cellulose, sorbitol or dextran. Optionally, the 

suspension may also contain suitable stabilizers or agents which increase the 
solubility of the peptides to allow for the preparation of highly concentrated 
solutions. 

15 Alternatively, the active ingredient may be in powder form for 

constitution with a suitable vehicle, e.g,, sterile, pyrogen-free water, before 
use. 

The peptides of the present invention may also be formulated in rectal 
compositions such as suppositories or retention enemas, using, e.g., 
20 conventional suppository bases such as cocoa butter or other glycerides. 

The pharmaceutical compositions herein described may also comprise 
suitable solid of gel phase carriers or excipients. Examples of such carriers or 
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excipients include, but are not limited to, calcium carbonate, calcium 
phosphate, various sugars, starches, cellulose derivatives, gelatin and polymers 
such as polyethylene glycols. 

Pharmaceutical compositions suitable for use in context of the present 
5 invention include compositions wherein the active ingredients are contained in 
an amount effective to achieve the intended purpose. More specifically, a 
therapeutically effective amount means an amount of peptide effective to 
prevent, alleviate or ameliorate symptoms of pathology or prolong the survival 
of the subject being treated, 
]0 Determination of a therapeutically effective amount is well within the 

capability of those skilled in the art, especially in light of the detailed 
disclosure provided herein. 

For any peptide used in the methods of the invention, the therapeutically 
111 effective amount or dose can be estimated initially from activity assays in cell 

15 cultures and/or animals, For example, a dose can be formulated in animal 
models to achieve a circulating concentration range that includes the IC50 as 
determined by activity assays (e.g., the concentration of the test compound, 
which achieves a half-maximal inhibition of the proliferation activity). Such 
information can be used to more accurately determine useful doses in humans. 
20 Toxicity and therapeutic efficacy of the peptides described herein can 

be determined by standard pharmaceutical procedures in experimental animals, 
e.g., by determining the IC50 and the LD50 (lethal dose causing death in 50 % 
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of the tested animals) for a subject compound. The data obtained from these 
activity assays and animal studies can be used in formulating a range of dosage 
for use in human. 

The dosage may vary depending upon the dosage form employed and 
the route of administration utilized. The exact formulation, route of 
administration and dosage can be chosen by the individual physician in view of 
the patient's condition. (See e.g., Fingl, et aL, 1975, in "The Pharmacological 
Basis of Therapeutics", Ch. 1 pJ). 

Dosage amount and interval may be adjusted individually to provide 
plasma levels of the active moiety which are sufficient to maintain therapeutic 
effects, termed the minimal effective concentration (MEC). The MEC will 
vary for each preparation, but can be estimated from in vitro and/or in vivo 
data, e.g., the concentration necessary to achieve 50-90 % inhibition of a 
proliferation of certain cells may be ascertained using the assays described 
herein. Dosages necessary to achieve the MEC will depend on individual 
characteristics and route of administration. HPLC assays or bioassays can be 
used to determine plasma concentrations. 

Dosage intervals can also be determined using the MEC value. 
Preparations should be administered using a regimen, which maintains plasma 
levels above the MEC for 10-90 % of the time, preferable between 30-90 % 
and most preferably 50-90 %. 
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of the tested animals) for a subject compound. The data obtained from these 
activity assays and animal studies can be used in formulating a range of dosage 
for use in human. 

The dosage may vary depending upon the dosage form employed and 
5 the route of administration utilized. The exact formulation, route of 
administration and dosage can be chosen by the individual physician in view of 
the patient's condition. (See e.g., Fingl, et al., 1975, in "The Pharmacological 
Basis of Therapeutics", Ch, 1 p.l). 

Dosage amount and interval may be adjusted individually to provide 
10 plasma levels of the active moiety which are sufficient to maintain therapeutic 

effects, termed the minimal effective concentration (MEC). The MEC will 

111 

*P vary for each preparation, but can be estimated from in vitro and/or in vivo 
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data, e.g., the concentration necessary to achieve 50-90 % inhibition of a 

proliferation of certain cells may be ascertained using the assays described 
is herein. Dosages necessary to achieve the MEC will depend on individual 

characteristics and route of administration. HPLC assays or bioassays can be 

used to determine plasma concentrations. 

Dosage intervals can also be determined using the MEC value. 

Preparations should be administered using a regimen, which maintains plasma 
20 levels above the MEC for 10-90 % of the time, preferable between 30-90 % 

and most preferably 50-90 %. 
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Depending on the severity and responsiveness of the condition to be 
treated, dosing can also be a single administration of a slow release 
composition described hereinabove, with course of treatment lasting from 
several days to several weeks or until cure is effected or diminution of the 
5 disease state is achieved. 

The amount of a composition to be administered will, of course, be 
dependent on the subject being treated, the severity of the affliction, the 
manner of administration, the judgment of the prescribing physician, etc. 

Compositions of the present invention may, if desired, be presented in a 
10 pack or dispenser device, such as a U.S. Food and Drug Administration (FDA) 
approved kit, which may contain one or more unit dosage forms containing the 
active ingredient. The pack may, for example, comprise metal or plastic foil, 
such as a blister pack. The pack or dispenser device may be accompanied by 
instructions for administration. The pack or dispenser may also be 
15 accompanied by a notice associated with the container in a form prescribed by 
a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals, which notice is reflective of approval by the agency of the 
form of the compositions or human or veterinary administration. Such notice, 
for example, may be of labeling approved by the U.S. Food and Drug 
20 Administration for prescription drugs or of an approved product insert. 
Compositions comprising a chemical conjugate of the invention formulated in 
a compatible pharmaceutical carrier may also be prepared, placed in an 
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appropriate container, and labeled for treatment of an indicated condition, such 
as a cancer of a certain type, an autoimmune disease or an allergy, 

Peptides of the present invention may be packaged in kits, each such kit 
comprising a plurality of individual containers, each of which containing at 
least one peptide identified by the method of the present invention. Such a kit 
can be used for two purposes. First, an in vitro functional assay, such as the 
CTL assay [20] or the ELISPOT assay [19] of, for example, cytokine (e.g., 
EL-2, TNF alpha or interferon gamma) production or development of 
cytotoxiciy using immune cells derived from a patient can be used to 
determine the immune response of the patient to each one of the peptides, 
which response to a large extent depends on the particular MHC haplotype of 



c p the patient. Second, once, a reactive peptide or peptides axe identified, either 

IS 

by an individualized in vitro assay or from in silico data as is further detailed 



below, suitable peptide or peptides from the kit can be administered so as to 
treat the patient. 

According to another aspect of the present invention there is provided 
an electronic data storage device, storing, in a retrievable form, a plurality of 
sequences of peptides identified by the method described herein. Various 
other parameters, such as the parameters identified in the Tables provided in 
the Examples section that follows, can also be linked to the peptide sequences, 
in, for example, a table form or any other form. Preferably, the plurality of 
peptides are arranged at least according to their association with a pathology 
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and further according to their ability of binding to MHC molecules of a 
particular haplotype. This in silico data can be u$ed instead or in addition to 
the in vitro assays described above to match a most active peptide to treat a 
pathology of a certain patient having a particular pre identified MHC 
haplotype. Thus, look up tables associating a peptide with a protein with a 
gene, with a disease with a haplotype, and/or with an efficiency score can be 
constructed and used to best suit a peptide for treatment of a disease in an 
individualized way taking into account the MHC haplotype of the patient to be 

^ treated. Of course, individualized in vitro assays can be used to ascertain 

*?. 

*J peptide selection. 

m 

The electronic data storage device can, for example, be an 

in 

1c! electromagnetically or electro-optically readable device and it preferably forms 

, 2 a part of a server that is accessible by users through a communications 

G 

III network, such as the Internet, an intranet or an extranet, via a plurality of user 

til 

*0 clients at the disposal of the users, A management software application 

O 

^ manages the data stored in the data storage device and is preferably designed 

to support search and retrieval of information from the database and deposition 
of information into the database. 

Thus, further according to the present invention there is provided a 
method of eliciting an immune response against a protein of interest in a 
subject having a particular MHC haplotype. The method according to this 
aspect of the invention is effected by determining the subjects particular MHC 
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haplotype; and administering to the subject an effective amount of at least one 
peptide derived from the protein of interest and which is capable of binding to 
MHC molecules of the particular haplotype. 

Still further according to the present invention there is provided a 
method of eliciting an immune response against a protein of interest in a 
subject. The method is effected by using an individualized in vitro assay for 
determining an immune reactivity of an immune system of the subject to a 
plurality of peptides derived from the protein of interest; and administering to 
the subject an effective amount of at least one peptide derived from the protein 
of interest and which is capable of inducing predetermined sufficient immune 
reactivity. 

According to another aspect the present invention provides a method of 
treating a pathology by eliciting an immune response against a protein of 
interest in a subject having a particular MHC haplotype. The method is 
effected by determining the subject's particular MHC haplotype; and 
administering to the subject a therapeutic effective amount of at least one 
peptide derived from the protein of interest and which is capable of binding to 
MHC molecules of the particular haplotype. 

As used herein the term "treating" includes prevention or cure of a 
pathology, such as a disease, syndrom or manifestation, effected by inhibiting, 
slowing or reversing the progression of the disease, syndrom or manifestation, 
substantially ameliorating clinical symptoms of a disease, syndrom or 
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manifestation or substantially preventing the appearance of clinical symptoms 
of a disease, syndrom or manifestation. 

As used herein the term "subject" refers to humans and animals having 
an MHC system, such as the HLA system in humans, in particular farm 
5 animals. It will be appreciated in this respect that the method of the present 
invention can be used to improve all kinds of peptide immunization via 
individualization for both human beings and animals. 

A variety of pathologies can be treated using the peptides of the present 
invention, including, but not limited to, cancers, infections, inflammations, 
ft 10 autoimmune diseases, allergies, etc. The gist of the present invention with 

Ul respect to treating pathologies lies in the fact that the present invention offers, 

S si; 
W : 

*P for the first time, individualization of the vaccine to the MHC haplotype of the 

treated subject. 

Additional objects, advantages, and novel features of the present 
|S 15 invention will become apparent to one ordinarily skilled in the art upon 

examination of the following examples, which are not intended to be limiting. 
Additionally, each of the various embodiments and aspects of the present 
invention as delineated hereinabove and as claimed in the claims section below 
finds experimental support in the following examples, 
20 EXAMPLES 

Reference is now made to the following examples, which together with 
the above descriptions, illustrate the invention in a non limiting fashion. 
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Generally, the nomenclature used herein and the laboratory procedures 
utilized in the present invention include molecular, biochemical, 
microbiological and recombinant DNA techniques. Such techniques are 
thoroughly explained in the literature. See, for example, "Molecular Cloning: 
5 A laboratory Manual" Sambrook et aL, (1989); "Current Protocols in 
Molecular Biology" Volumes I-III Ausubel, R. M, ed. (1994); Ausubel et aL, 
"Current Protocols in Molecular Biology", John Wiley and Sons, Baltimore, 
Maryland (1989); Perbal, "A Practical Guide to Molecular Cloning", John 
n Wiley & Sons, New York (1988); Watson et aL, "Recombinant DNA", 

"SSSf 
' **$ 

m io Scientific American Books, New York; Birren et aL (eds) "Genome Analysis: 

y & 

ff| A Laboratory Manual Series", Vols. 1-4, Cold Spring Harbor Laboratory 

ill 

£ Press, New York (1998); methodologies as set forth in U,S. Pat. Nos. 

r 4,666,828; 4,683,202; 4,801,531; 5,192,659 and 5,272,057; "Cell Biology: A 

jjl Laboratory Handbook", Volumes I-III Cellis, J. E., ed. (1994); "Culture of 

% 15 Animal Cells - A Manual of Basic Technique" by Freshney, Wiley-Liss, N. Y. 

(1994), Third Edition; "Current Protocols in Immunology" Volumes I-ITJ 
Coligan J. E., ed. (1994); Stites et al. (eds), "Basic and Clinical 
Immunology" (8th Edition), Appleton & Lange, Norwalk, CT (1994); Mishell 
and Shiigi (eds), "Selected Methods in Cellular Immunology", W. H. 
20 Freeman and Co., New York (1980); available immunoassays are extensively 
described in the patent and scientific literature, see, for example, U.S. Pat. 
Nos. 3,791,932; 3,839,153; 3,850,752; 3,850,578; 3,853,987; 3,867,517; 
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3,879,262; 3,901,654; 3,935,074; 3,984,533; 3,996,3*5; 4,034,074; 4,098,876; 
4,879,219; 5,011,771 and 5,281,521; "Oligonucleotide Synthesis" Gait, M, J., 
ed. (1984); "Nucleic Acid Hybridization" Hames, B. D., and Higgins S, J., 
eds. (1985); "Transcription and Translation" Hames, B. D., and Higgins S. J., 
5 eds. (1984); "Animal Cell Culture" Freshney,R. L, ed. (1986); "Immobilized 
Cells and Enzymes' 1 IRL Press, (1986); "A Practical Guide to Molecular 
Cloning" Perbal, B„ (1984) and "Methods in Enzymology" Vol 1-317, 
Academic Press; "PCR Protocols: A Guide To Methods And Applications", 
P Academic Press, San Diego, CA (1990); Marshak et al., "Strategies for Protein 

03 10 Purification and Characterization - A Laboratory Course Manual" CSHL Press 

H' (1996); all of which are incorporated by reference as if folly set forth herein. 

if! 

•ST u 

g Other general references are provided throughout this document. The 

p procedures therein are believed to be well known in the art and are provided 

pj for the convenience of the reader. All the information contained therein is 

O 1 5 incorporated herein by reference. 

MATERIALS AND EXPERIMENTAL PROCEDURES 
Cell lines: 

The human cancer cell lines: PC3 (prostate cancer), UCI-107 and 
UCM01 (both ovarian cancer), MDA-231 and MCF-7 (both breast cancer) 
20 were obtained from the ATCC. The human B-cell line C1R was a generous 
gift from Nick Zavazava. UCI-107, UCI-101, MDA-231 and MCF^7 cells 
were maintained in DMEM containing 10 % FCS, 1 mM glutamine, 0.1 mg/ml 



29-MAY-2001 21:07 FROM: 



TO: 972 9 7706333 



P. 059 



PI 
ill 

TP S 

ill 



M 

yi 



58 

streptomycin and 100 units/ml penicillin. PC3 and C1R cells were maintained 
in RPMI 1640 containing 10 % FCS, 1 mM glutamine, 0.1 mg/ml 
streptomycin and 100 units/ml penicillin. For growing MCF-7 cells without 
estrogen, the cells were maintained in DMEM without sodium pyruvate and 

5 phenol red and containing 4 % FCS stripped of estrogen, 1 mM glutamine, 0. 1 
mg/ml streptomycin and 100 units/ml penicillin. Culture media, and serum 
were obtained from GibcoBRL. 

RMA-S-HHD is a murine TAP-2 deficient lymphoma clone of 
C57BL/6 origin, transfected with HLA-A2.1/Db-(32m single chain (HUD) 

10 [23]. The RMA-S-HHD-B7.1 cells transfected by the murine B7.1 
costimulatory molecule (CD80). EL4-HHD is the murine lymphoma cell line 
EL4 transfected by HHD. RMA-S-HHD, RMA-S-HHD-B7 1 and EL4-HHD 
were maintained in RPMI 1640 containing 10 % FCS and 1 mM glutamine, 
0.1 mg/ml streptomycin and 100 units/ml penicillin. After transfection, the 

15 cells were maintained in medium supplemented with 500 to 1000 ng/ml of the 
antibiotic G418 (GibcoBRL). 
DNA: 

Plasmid HLA-A2/Q10b usc d for expression of soluble MHC, contains 
the first five exons of the HLA-A2 fused to exons 5 to 8 of the murine mutant 
20 QlOb, which lacks a functional transmembrane domain and is therefore 
secreted. This plasmid was a generous gift from D. Margulies, of the NTH 
[24]. Plasmid (php2m) was constructed to express the human 
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^-microglobulin. It is based on the cDNA of human p2m (hp2m) isolated 
from PC3 cells and amplified by PCR using the following primers: 5 '-sense 
primer: 5-AGATTCCC AAGCTTATGTCTCGCTCCGTGG-3 ' (SEQ ID 
NO:40) contained a restriction site for Hind III before the signal peptide and a 
5 3' antisense primer 5-AGCTAGTCTAGATTATCACATGTCTC 
GATCCC ACTTAAC-3 r (SEQ ID NO:41) contained the restriction site for 
Xbal on the 3' end of p2m. The purified PCR product was cut with HindRl 
and Xbal and ligated into the eukaryotic expression vector pCDNA-3.1 
p (Invitrogen). Plasmid sHLA-A2 and sHLA-B7 contains the cDNA of the first 

so. 

63 10 4 exons of this alleles ligated into the plasmid pcDNA3 . 1 [34]. 

01 

Jjj Antibodies and hybridomas: 

1/1 

if The hybridomas W6/32 and BB7.2, an anti-MHC class-I and 

p anti-HLA-A2 respectively, and HB-149 an anti P2m were obtained from the 

ft 

!=y ATCC The antibodies were affinity purified using protein A-Sepharose 

O 15 CL-4B (Sigma) from mouse ascites fluid. 

Transfection of cancer cells and selection of clones secreting sMHC: 
Cell lines were co-transfected with plasmid HLA-A2/Q10 b and with 
php2m, which conferred resistance to the antibiotic G418 or transfected only 
with the plasmids SHLA-A2 and sHLA-B7 that contained the antibiotic 
20 resistance. Cells were electroporated by use of a Gene Pulser (Bio-Rad) set at 
280-300 mV 960 jiF. Transfected cell clones were selected in G418 antibiotic 
and screened for those secreting sMHC to the growth medium. Secretion of 
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sMHC was assayed by sandwich-ELISA with plates coated with the mAb 
BB7.2 (for sMHC-A2) or by HB-149 (for sMHC-B7) and sMHC was detected 
with the biotinilated mAb W6/32. Color was developed with ABTS (Sigma) 
catalyzed by streptavidin peroxidase (Sigma). 

5 Affinity purification of soluble MHC: 

Cultured cells, expressing the soluble MHC were grown to confluency 
in 150 mm plates. The culture medium was collected and residual cells were 
removed by centrifugation. Soluble MHC molecules were purified from the 
cleared culture medium by affinity chromatography on W6/32 antibody 

10 columns at 4 °C. The antibodies were coupled to NHS-activated agarose 
(Pharmacia) or to protein A Sepharose (Sigma) with n-methylpipelimidate 
(Sigma). The columns were washed with 0,5 M NaCl, 20 mM ? Tris pH 8. The 
MHC molecules were eluted from the column with 0,1 M acetic acid at pH 3. 
Peptides were separated from the MHC complexes by boiling for five minutes 

15 in 10 % acetic acid followed by ultra-filtration through a 3 kDa Microcon 
(Amincon) [2). 

Synthetic peptides: 

Peptides were synthesized on AbiMed AMS 422 multiple peptide 
synthesizer (Abimed, Langenfeld, Germany) by Fmoc chemistry, precipitated 
20 with ether and used with or without further purification (HPLC), 
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Peptides separation and analysis: 

The MHC bound peptides were resolved by reverse-phase HPLC on a 
0.1 ID fused silica capillaries with length of about 30 cm (J&W) slurry packed 
with POROS 10 R2 (PerSepetive Biosystems). The capillaries were fitted with 
5 electrospray needle made from 36-gauge stainless tubing (Small Parts Inc, 
Miami Lakes, FL). A Rheodyne 9125 HPLC injector fitted with a 20 loop 
was used for loading the column. The peptides were resolved by a relatively 
long (90 minutes) linear gradient of 5 to 50 % acetonitrile with 0.1 % acetic 
n acid, at a flow rate of about 1 fjl/minute, The flow was electrosprayed directly 

05 10 from the HPLC column into an ion trap mass spectrometer (LCQ, Finnigan). 

m 

|fi The mass spectrometry analysis was done in the positive ion mode, using 

an 

4» repetitively a full MS scan usually between 450 to 1500 atomic mass units 

CO 

^ (amu) followed by collision-induced decomposition (CID) of the dominant ion 

jjfjj selected from the previous MS scan. In some cases the full MS was performed 

S IK?' 

^ is with a narrower mass range to reduce the number of detected peptides. The 

peptides were identified by comparing their MS and CID data to the calculated 
MS and CID of the proteins in the Genpept databank 
(www.ncbi.nlm.nih.gov/genpept) using the Sequest software [25] (obtained 
from Finnigan, San Jose, CA). The number of times each peptide was 
20 fragmented by CID was usually limited to two by dynamic exclusion, a feature 
of the Xcalibur control software the LCQ mass spectrometer (Finnigan), 
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Stabilization of cell surface HLA-2 J by peptide binding: 
RMA-S-HHD cells were washed three times with PBS followed by 
incubation overnight in FCS-free IMDM medium at 26°C. Synthetic peptides 
were added to 10 6 cells at a concentration of 100 uM. The cells were 
5 incubated for two hours at 26°C followed by two hours at 37°C. The 
stabilization of the HHD MHC by the peptides binding was measured by 
FACS analysis on Becton Dickinson FACStar flow cytometer after decorating 
the cells with W6/32 mAb at 4°C for one hour and then 30 roin incubation with 
anti-mouse FITC at 4°C (Sigma). 
io Cytotoxic T lymphocytes assays: 

Transgenic mice expressing a single chain HLA-A2.1/Db-(32m which 
are double knockout for H-2Db and for p2m (HHD mice) [23] were 
immunized four times intra-peritoneally at 7-day interval with 2 x 10^ 
irradiated (5,000 rad) RMA-S-HHD-B7.1 cells loaded for two hours at 26 "C 
is followed by two hours at 37 °C with 100 hM of the synthetic peptides. Ten 
days after the last immunization the spleens were removed from the vaccinated 
mice. Splenocytes were re-stimulated with 100 \M of synthetic peptides for 
five days. Viable lymphocytes were separated by lympholyte-M (Cadarlane, 
Hornby, Canada) and resuspended in RPMI-HEPES. Cytotoxic activity was 
20 measured as in [26] by admixing the lymphocytes at different ratios with 5 x 
10 3 EL4-HHD cells grown in medium containing 35 S methionine and then 
loaded with the synthetic peptides. 
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EXPERIMENTAL RESULTS 
In order to identify large number of MHC bound peptide antigens 
presented in the context of a particular MHC haplotype, different human cell 
lines were transfected with expression vectors for soluble, secreted MHC 
5 molecules. Indeed, different soluble MHC could be transfected into various 
cell lines resulting in enabling the recovery of large amounts of the soluble 
MHC molecules from the cell's growth medium. The sMHC molecules were 
recovered with their authentic patterns of peptides still bound and free of 
contamination by cellular debris and detergents. Prostate (PC3), ovarian 
gj io (UCI-107) and breast (MDA-23 1 and MCF-7) cell lines were transfected with 

the DNA coding HLA-A2.1/Q10b or sMHC-A2 and $MHC-B7. Soluble 
MHC molecules were recovered from the culture medium without disrupting 
the cells and the sMHC molecules were purified by a single step of 
immunoaffinity chromatography. About 200 yg of the sMHC molecules were 
15 recovered from about 10^ cells (Figures 1A-C). The MHC large subunit, the 
(J2m and small amounts of antibodies that were released from the 
immunoaffinity column by the acid treatment were the only proteins detected 
in the column eluant. The peptides were separated from the proteins subunits 
of the MHC by ultra- filtration. The recovered heavy subunit of the soluble 
20 MHC molecules was confirmed to be that of HLA-A2.1 by peptide mapping 
and by micro sequencing. 
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Sequencing of a large number of individual peptides was approached by 
electrospray tandem mass spectrometry. The peptides were partially resolved 
on homemade nano-capillary reversed phase columns interfaced directly to an 
electrospray mass spectrometer. The peptide mixtures were resolved by 
5 relatively long reversed-phase HPLC gradients on long capillary columns, 
enabling performing mass measurements and fragmentation of a large number 
of peptides. The mass spectra were recorded between 450 to 1500 mass units, 
which is the expected mass (m/z) range of the singly and the doubly charged 

r*a MHC bound peptides. The mass spectrometry data included the 

^0 

io total-ion-current chromatogram (TIC, Figure 2A) and the mass spectrum of the 

fji 

|H peptides at each time point (Figure 2B). The mass spectrometer was 

4* programmed to repeatedly select the most abundant peptide observed in each 

^ spectrum and to fragment it by CID (Figure 2C). Peptides were identified by 

U 

i n 

pj comparing their masses and the masses of their fragments to those calculated 

yQ 

p 15 for peptides derived from all the human proteins in the databank. The 

computer programs were instructed to search for putative peptides resulting 
from non-specific proteolysis since the specificity of proteases responsible for 
generating the MHC bound peptides in cells is not well defined. 

The relatively high sensitivity of the capillary ESI-MS/MS analysis and 
20 the large amounts of peptides recovered from the cells by use of the soluble 
MHC, enabled to perform multiple capillary HPLC separations with each 
peptide preparation. Peptides recovered from soluble MHC produced by about 
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5x10? cells were used for each capillary chromatography. Multiple 
chromatography runs enabled to detect those peptides that were observed 
reproducibly and to combine their CID data to improve the signal-to-noise 
ratio of the CID spectra. The combined and improved data sets were used for 
5 databank searches and peptide identifications. Using relatively long capillary 
columns (of above 30 cm) and long reversed phase gradients facilitated 
achieving high resolving power. Most peptides elute normally during 15 to 30 
seconds, which was a sufficient time for the mass spectrometer to analyze up 
q to three different co-eluting peptides. The mass spectrometer was 

(55 10 programmed not to fragment any peptide more than twice in order to increase 

ffS 

IT the total number of peptides analyzed during each chromatography. 

Ill 

*f A total of about three thousands different peptides were sufficiently 

CO 

resolved and fragmented during the different chromatography runs of the 

y 

% I mixtures eluted from the sHLA-A2 and SHLA-B7 recovered from the different 

iO 

f% 15 cell lines. The large majority of the observed peptides were common to all the 

different cancer lines and only a small fraction was detected in only one of the 
cancer types. From this large number of detected peptides, about 200 were 
identified at high certainty to be derived from known proteins and the rest 
were not identified. Table 8 is typical list of such peptides recovered from the 
20 soluble MHCs and identified by the computer analysis. Among these peptides, 
fourteen were already known as MHC bound peptides. Those desired peptides 
that originate from putative tumor antigens were chemically synthesized to 
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further evaluate the accuracy of their amino acid sequences and to enable to 
study them as MHC bound peptides and their significance as cancer antigens* 
Their amino acid sequence accuracy was ascertained by running a 
chromatography of the synthetic peptides using the exact conditions 

5 immediately after the natural peptides mixtures and comparing the 
chromatography retention times, the exact ma$$e$ and the CID spectra of the 
synthetic and natural peptides (Figures 3A-B). When synthetic peptides 
behaved identically to the natural peptides in these three criteria served as a 
clear indication that the identification was indeed correct. Twenty-seven of the 

io most interesting peptides were chemically synthesized and confirmed to be 
correct by this assay, example of which is displayed in Figures 3A-B. 

Table 8 

List of peptides recovered from sMHC of different cancer cells and identified 
1 5 at high certainty by mass spectrometry 



Peptides from soluble HLA-A2 





Mass 
(miz) 


Sequence 
(SEOIDNO:) 


Protein 


Position 1 


Score 2 


Score 3 


Synthetic 4 


fret* 


I 


898.4 


LLDVPTAAV(l) 


y IFN inducible protein (1P-30) 


17-25 


159.9 


28 




[41] 


2 


1011.5 


LLLDVPTAAV(2) 


Y TFN inducible protein (I.P-30) 


16-25 


1793.7 


31 




[41] 


3 


1210.4 


LtUWPTAAVQA(3) 


Y If N inducible protein (1P-30) 


16-27 


128.1 


21 




141) 


4 


800 5 


GLLGTLVQ(4) 


Beta catenin 


400-407 


0.2 


17 


+ 




5 


913.4 


CLLGTLVQL(S) 


Beia eaten in 


400-408 


181.7 


3t 






6 


922.3 


AJ-FGAtFLA(6) 


Phospholipid transfer protein 


2-10 


245.2 


23 


+ 




7 


945.4 


SULGGDWSV(7) 


TSC-22-like protein 


22-32 


591.9 


34 






8 


947.4 


NLTr$DV$V(S) 


MUCI 


130-138 


69.6 


23 


+ 


(26) 


9 


958.3 


SiWQQPAEA(9) 


Human collagen type J V 


18-25 


41,2 


23 


+ 




10 


98 J .7 


SLIGHLQT100) 


protein tyrosine posphatese 


336-344 


49.1 


32 






I] 


m,$ 


SLSEKTV1X0J) 


CD59 


106-U4 


87.6 


29 


+ 




12 


989,4 


$UPCKLEV(12) 


Flightless I horoolog 


1010-18 


257,3 


30 


+ 




M3 


1028.5 


GL1EKNJEL03) 


DNA methyl transferase (MTDM) 


425^433 


87.6 


28 


+ 




14 


1031.4 


GLYPGUWU14) 


Interferon regulatory factor-6 


21-29 


864.8 


30 






15 


1038.5 


YLLPA1VHI0$) 


RMA heticase 


146-154 


408.4 


30 




PI 


16 


1068.4 


ALSDHHIYL(l6) 


Fructose bis phosphate aldolase 


216-224 


481.7 


23 


+ 




17 


1071.5 


TLDQKINEV07) 


ornithine decarboxylase 


23-31 


108.8 


30 




[96] 


18 


1071,6 


ILOlOCVEKV(l8) 


Human HSP 90 betfuHSP 84 


570-578 


53.3 


29 




[74] 


19 


1080.4 


SLLPPTALVGU19) 


H. Transporter SEC23A 


156-164 


181.8 


33 






20 


1091.4 


CVYDGEEHSV(20) 


MAGE-B2 


231-240 


799 


20 
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2t 1094.5 


SaPPDALVGL(21) 


K Transporter SEC23B 


15(M6Q 


181.8 


33 






22 1121 J 


TLWVDPY£V(22) 


B- cell translocation gene (BTGl) 


103-111 


577,3 


24 


+ 




25 1145.4 


FLFDGSPTYV(23) 


Fatly acid synthase (FAS) 


292-301 


26694 


23 


+ 




24 1258 5 


FLPDCSPTYVL{24) 


Patty acid synthase (FAS) 


292-302 


611.2 


27 






25 1360,4 


ALWDIETGOOTVa5) 


guanine nucleoli de-binding 


167-178 


2366.8 


28 


+ 




Peptides from soluble HLA-B7 


Mass 


Sequence 


Protein 


Position 1 


Score 2 


Score 3 


Synthetic 4 


Rcf 5 


(mlz) 


(SEQID NO:) 














1 854.3 


VP$ePCGVL(26) 


70 kDaSHP-lL 


422-30 


120 


27 


+ 




2 883.4 


SPTQPlQC(27) 


cell membrane glycoprotein 


257-61 


80 


20 










1 10000 Mr 












3 895.4 


SPALPCLKL(28) 


transmembrane activator and 


147-55 


120 


27 


+ 








CAMt interactor 












4 899.5 


APRTVALTA(29) 


HLA-SB beta 


9-17 


60 


24 




t?5] 


5 927.3 


SPKLPVSSipO) 


DNA binding protein homolog 


372-80 


120 


25 


+ 




6 989.3 


KPSLPFTSL(31) 


translation initiation codon 


79-87 


120 


28 


+ 




7 m.s 


LVMAPRTVU32) 


MHC class-! 


2-10 


135 


18 




(75) 


8 1050.4 


KPAfPAEKL(33) 


annexin A I 


274-82 


80 


22 






9 1075.4 


SPYQNIKJU34) 


spermidine ami nopropyi transferase 


128-36 


80 


20 






10 1104.5 


AASKERSGVSL(3$) 


Historic HI 


50-60 


36 


18 




(751 


11 1114.3 


APF6PLASGIL(36) 


precursor 


2-12 


240 


22 


+ 




12 M94.5 


APSOSLA VPLAV1/3 7) 


hypothetical orotctn 


0-21 


360 


31 







Table 8: An example of MHC bound peptides that were identified by the 
5 Sequest software [25] (obtained from Firmigan, San Jose, CA) after mass 
spectrometer analysis. ^Position of the first and the last amino acid of the 
peptide. ^Calculated score, estimating half the time for dissociation of the 
peptide-MHC complex (42). 3 Calculate score. 4 sequence approved by 
analyzing in comparison a synthetic peptide. 5 Peptide is known. 

10 Among the many peptides derived from different housekeeping proteins 

and enzymes, some peptides were determined to be derived from known tumor 
associated antigens. These include mucin (MUC-1) and MAGE-B2 while 
others were derived from proteins whose level is known to be significantly 
elevated in cancer cells such as beta-catenin, DNA methyl transferase and fatty 

15 acid synthase (Table 8), 

A comparison in the patterns of peptides presented by the same MHC in 
cell lines of different tissue origin enabled the identification of those peptide 
uniquely presented in only cells of a particular tissue origin. Only a few of the 
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peptides were determined to be unique to specific cell lines while most of the 
peptides that were observed in all the different cell lines were derived from 
normal cellular proteins. Also, significantly different patterns of peptides were 
recovered from sHLA-A2 and from sHLA-B7. Examples for unique peptides 
5 which are displayed in Table 9 include peptide p922 (phospholipid transfer 
protein) recovered only from PC3 cells and peptide p947 (SEQ ID NO;8) 
(MUC1) recovered only from MCF-7 grown without estrogen. Peptide p945 
(SEQ ID NO:7, derived from TSC-22-like protein) is a novel peptide that was 
y detected at high level in the two-breast cancer cells (MCF-7 and MDA-231), 

^ 10 but was not observed in the ovarian (UCI-107) and the prostate (PC3) cancer 

HI 

|ji cells, Peptide p981 (SEQ ID NO:10) originated from protein tyrosine 

fS phosphatase, and was detected only in the breast cancer cell MDA-231. One 

0 of the most interesting novel peptides identified was pi 091 (SEQ ID NO:20) 

01 

? ; derived from the tumor antigen MAGE-B2, The peptide was detected only in 

J*f 15 the ovarian cancer cells (UCI-107) and not in the other cell lines. The 

synthetic and natural peptides elution pattern and CID spectra of both were 
identical (Figures 4A and 4B). The binding affinity of this peptide to the 
MHC molecules was determined to be normal as assayed by reconstitution and 
stabilization of empty MHC on the surface of RMA-S-HHD cells (Figure 4C). 
20 This peptide is derived from the same region in the MAGE proteins, as do 
other previously identified MHC bound peptides derived from MAGE-A4 and 
from MAGE-A10 [27, 28] (Figure 4D). 
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Comparison of MHC peptide patterns between cell lines of different cancer 

origin 





Mass 


Sequence 


MCF-7 


1 MCF-7 


MDA^ 


PC-3 


UCI- 


ucr- 




(m&) 


(SEQtDNO;) 




without 


231 




107 


101 










estrogen 










1 


898.4 


UOVPTAAV(l) 






+ 




+ 


+ 


2 


1011.5 


LLLDVPTAAV(2) 


+ 




+ 


_ 


T 


+ 


3 


1210.5 


UXDVPTAAVQA(3) 


+ 


+ 






4- 


+ 


4 


£00.5 


GllCTLVQ(4) 


m 


m 




_ 


+ 




5 


913.4 


(5LLGTLVQL(5) 


+ 


+ 


+ 


+ 


+ 


4- 


6 


922.3 


ALFGALFLA(6) 


_ 






+ 






1 


945.4 


$LLGGDWSV(7) 




+ 


4* 


_ 


- 


- 


8 


947.4 


NLT1SDVSV(8) 


_ 


+ 




_ 








958.3 


$LWGQPAEA(9) 


4* 






■f 




+ 


10 


981.7 


SLIOHLQTL{10) 






+ 








11 


989.5 


SLSfilCTVLL(n) 










+ 


+ 


12 


989.4 


SLFPQKL£V(\2) 






+ 


4- 


+ 


4- 


>13 


1028.5 


GUEKMEL(13) 








+ 


+ 


+ 


14 


1031.4 


GLY?QUWL(14) 


+ 


4* 


4- 


4- 




+ 


15 


) 038.5 


YLL?A)VHI(I5) 


+ 


+ 


+ 


+ 


T 


T 


16 


1068.4 


ALSDHHJYL(16) 


+ 




4* 








17 


1071.5 


TLDQKINEV(17) 




+ 


+ 


4- 


+ 




18 


1071.6 


ILDwCVtiKV(lc) 






+ 








19 


1080.4 


01 T DDT AT VAT fiQ\ 










4* 


-J- 


20 


1091.4 


GVYIXiHfiH!SV(20) 














21 


1094.4 


SLLP?DALVuL(2 1 ) 


+ 


4- 


4- 








22 


1121,3 


fLWVDPYBV(22) 


+ 


+ 


4" 


4" 




+ 


23 


1145.4 


FLFDGS FTYV(23) 














24 


U58.5 


FLfDGSPTYVL(24) 


+ 


+ 


+ 






4. 


25 


1360.4 


ALWDlhlGQQTV (2 S ) 














(B) 


















Mass 


Sequence 


cm 


MDA-2 


UCl-10 










(mfc) 


(SEOIDNO:) 




31 


7 








i 


854.3 


VPSEPGGVL(26) 














2 


883.4 


SPTQHQU27) 




+ 










3 


895.4 


SPALPGLKL(28) 


+ 












4 


899.4 


APRTVALTA(29) 


+ 












5 


999.5 


SPKLPVSSL(30) 


+ 


t 


+ 










927,3 


<PStPPTSt(30 






+ 








7 


989.3 


LVMAPRTVL(32) 














9 


1050.4 


KPAFFAEKL(33) 






+ 








9 


1075.4 


SPYQWIKJl(34) 














10 


1104.5 


AASK£RSGV$L(3 5) 






+ 








11 


1114,3 


AfFG PtASGTL(36) 






+ 








J2 


1194.5 


APSGSLAVPIAVU37) 




+ 











10 



cancer cells MCF-7 and MDA-231, 1 MCF-7 that grown without estrogen, 
prostate cancer cell PC-3 and the ovarian cancer cells UCI-107 and UCM01. 
(B) The appearance of peptides from soluble HLA-B7 in B cell leukemia 
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cancer cells C1R, breast cancer cells MDA-231 and ovarian cancer cells 
UCM07. 

Another approach to ascertain that the identified peptides were indeed 
MHC bound peptide antigens, their capacity to bind tightly and stabilize cell 
5 surface HLA-A2.1 was tested by reconstitution into empty MHC on the 
surface of RMA-S-HHD cells. Binding was assayed by FACS analysis after 
decorating the cells with the fluorescent anti-intact MHC mAb W6/32 (Figure 
5). Nine of the synthetic peptides were determined to stabilize cell surface 
MHC significantly more than without the added peptides and to a similar 
q 10 extent as peptide (G9-209-2M) IMDQVPFSV (SEQ ID NO;42) 9 derived from 

08 the melanoma protein gp- 1 00 [29] . 

U! To further evaluate the affinity of the peptides to the HLA-A2 and to 

111 

as 

+: obtain some insight into their immunogenic potential, selected peptides were 

L tested for their ability to induce an immune response in HLA-A2 transgenic 

nj 15 mice. It was assumed that only peptides that could be effectively presented 

q and remain tightly bound to the cells would be capable of inducing an immune 

response in these mice. The same synthetic peptides that were used for the 
FACS analysis were used for immunization of the HHD transgenic mice, 
which express the human HLA-A2.1/Db- (J2m single chain. To immunize the 
20 mice, the HHD culture ceils were loaded with the different peptides and then 
injected to the HHD mice. The immune response in the mice was followed by 
the appearance of CTLs specific for these peptides. The lysis patterns of the 
target HHD cells by T-cells taken from the immunized mice are shown in 
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Figure 6. Some of the peptides were indeed capable of inducing an immune 
response, which both authenticate them as MHC bound peptides and gives an 
indication about their immunogenic potential. The CTL results demonstrate 
significant lysis of EL4-HHD cells loaded with the peptides pl028 (SEQ ID 
5 NO:13) from DNA methyl transferase, pl258 (SEQ ID NO:24) from fatty acid 
synthase, pi 121 (SEQ ID NO:22) from B cell translocation gene (BTG) and 
pi 068 (SEQ ID NO: 16) from aldolase as compared to the negative control 



peptide ALLCAPSLL (SEQ ID NO;43). 



SUMMARY OF PEPTIDE INFORMATION FOR SOLUBLE HLA-A2 



In 
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m 



II! 



10 



is 



20 



25 



30 



35 



The following provides a summary of peptide information so far 
collected for eptides bound to soluble HLA-A2 using the method of the 
present invention. 

The following notations are used herein: 



<3: 
Mg: 
Mp: 
Tg: 
Tp: 
S: 
A2: 
B7: 
Pi 
PR; 
PCS: 

genpept : 
ref : 

Cell lines: 
#D; 
#E: 
#F: 
#EST: 
#FR: 



group number until May 7, 2001 

mass of the natural peptide 

mass of the identified peptide 

observed retention time of the natural peptide 

calculated retention time of the identified peptide 

calculated internal score 

adherence to HLA-A2 consensus motif 

adherence to HLA.-B7 consensus motif 

identified peptide sequence 

protein from which sequence is derived 

location of peptide in protein 

link to protein information in GenBank 

previously known peptide 



#H 
#1 



PC3+A2/Q10 

MCF7+A2/Q10 

MDA-231+A2/Q10 

MCF7(with estrogen) +A2/Q10 

MCF7 (without estrogen) +A2/Q10 

UCI-107+A2/Q10 

C1R+SB7 

UCI-107+SB7 
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MDA-231+SB7 
UCI-101+S&2 

2180 (ovarian cancer cell line)+sA2 
synthetic peptides 

Mg- 800.4: #£+(2,1) #G+ ( 10, 7 ) 5=83 { 87 , 74 ) Mg° 800.5 Tg-38+-0 
Mp=600.5 A2= 0.02/18 F-GLLGTLVQ 

PR=*>gi| 860988 |emblCAA6H07,l I (X87838) beta-catenin 

[Homo sapiens] POS-399 fSEQ ID NO: 4) 

G-1234: Mg= 810.3: Tg-31+-1 #D+{2,2) #E+(7,4) #F+(4,3) #EST+(7,2> 
#FR+{4,2) #G+(33,11) #K+(3,2) #L+(2,1) 

5=84(87, 79) Mp= 810.2 (-0,1) Tp= 34 A2= 11/29 P=>ALAPGLPTA 
genpept PR=>gi j 5771535 I gb| AADS1419. 1 1 AF173937 J. (AF173937) secreted 
protein of unknown function [Homo sapiens] POS«21 (SEQ ID NO: 44) 



#L 
#S 



G=1990 
Tp= 53 
genpept 



20 



G=l251: Hg= 811.4: Tg-35+-0 #G+(5,3) 

S=96(96,99) Mp=» 811. 5( 0.1) Tp= 36 A2=465/26 P-KLLEPVL 

genpept PR=>gi I 338447 | gb I AAA60583 . 1 1 (M60854) RPS16 [Homo sapiens] 

POS=50 (SEQ ID NO: 45) 



o 

. Pi 
IB 

01 

3 R 
Us 



G-=1378: Mg= 841.3: Tg«41+-0 #G+(9,5) 

S~77(83,66) Mp<= 841.4 ( 0.1) Tp* 40 A2=0.0/16 P°SLLPAIVE 
genpept PR=>gi 1 189428 I gbj AAA36399. 1 1 (J02902) phosphatase 2A 
25 regulatory subunit [Homo sapiens] POS=403 (SEQ ID NO: 46) 

G=14l9: Mg= 848.3: Tg~34+-i #E+(2,1) #F+(2,1] #G+(5,5) #K+(l r l) 
5^84(83,89) Mp= 848.4 ( 0.1) Tp= 34 A2= 52/26 P°SVLGSLSSV 
genpept PR=>gi I 5833114 | gb IAAD53401 . 1 |AF107840_1 (AF107840) nuclear 
30 pore-associated protein [Homo sapiens) POSo280 (SEQ ID NO: 47) 



Iff 



12 



G=1420: Mg= 848.4: Tg=37+-1 #D+{2,2) #E+(U,6] #F+{8,5) #EST+{3,2) 
#FR+(4,2) #G+(25,11) 

S=95{94,99) Mp- 848. 4 ( 0,0) Tp= 39 A2-118/28 P=LLGPPPVGV 
35 genpept PR=>gi 1 10436199 1 dbj I BAB14750 . 1 1 (AK023978) unnamed protein 
product [Homo sapiens] POS=159 (SEQ ID NO; 48) 

(5=1439: Mg= 852,3: Tg=22+-2 #FR+(2,1) #1+ (32, 6) 

3=83(81,89) Mp= 852.0 (-0.3) Tp=- 25 A2-0.0/1 F-PGPCPPrpfr 
40 genpept PR=>gi | 5689367 I dbj I BAA82967 . 1 1 (AB021227) membrane-type-5 
matrix metalloproteinaae [Homo sapiens] POS=ll (SEQ ID NO: 49) 



45 



G«1492: Mg= 860.3: Tg=37+-I #G+(9,5) 

S=81l87,69) Mp= 860.3 ( 0.0) Tp« 31 A2=116/28 P=SM3GPLIGV 
genpept PR=>gi 1 1469189 I dbj i BAA09482 . 1 1 (D50923) The KIAA0133 gene 
product is novel. [Homo sapiens] POS t =629 (SEQ ID NO: 50) 



G=1510: Mg^ 862.5: Tg=>29+-2 #F+(4,4) ttG+ (4 r 2) 
S=78(87,59) Mp= 862.2(-0.3) Tp= 27 A2=116/32 
50 genpept PR=>gi 1 12484559 1 gb I AAF20366 . 2 I AF150754_1 

3'phosphoadeno3ine 5 ' -phosphosulfate synthase 2b 
sapiens] POS=542 {SEQ ID NO:51} 



P=SMAPGLT$V 
(AF150754) 
isoform [Homo 



55 



G=1540: Mg« 
#K+{3,3) 



868.4: Tg«43+-0 #E+(2,1) #F+(7,4) #FR+(1,1) #G+(4,4) 
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S=83{89,69) Mp= 868. 4 ( 0.0) Tp- 46 A2- 19/30 P=LLIPGLATA 
genpept PR~>gi| 227 4974 I emb | CAA57489 . 1 1 (X81900) NADH oxidoreductase 
subunit MWFE [Homo sapiens] POS=16 (SEQ ID NO: 52) 

G=1563; Mg- 871,3: Tg=36+-1 #D+U,D #E+(8,6) #F+(4,3) #EST+(3,2) 
#FR+(5,2) #G+(15,9) 

S=86 (89, 79) Mp<= 871.4 ( 0.1) Tp= 33 A2=592/33 P=GLLGNVAEV 
genpept FR*>gi 1 12655181 1 gblAAH01447 . 1 1 AAH01447 (BC001447) Similar to 
ZYG homolog [Homo sapiens] FOS=l0 {SEQ ID TO: 53) 



10 



GcO.575: Mgo 872.4: Tgo32+-3 #D+(2,1) #E+(3,3) #F+(24,7) #EST+(3,2) 
ttFR+{4,2) #G+(12,7) 

S=79(76,88] Mp= 872, 5 ( 0.1) Tp= 33 A2= 11/26 P-SLIKLVEA 
genpept PR=>gi I 7020538 j dbj I BAA91170. 1 1 (AK000444) unnamed protein 
15 product [Homo sapiens) POS°277 (SEQ ID NO: 54) 

G=1606; Mg= 876.4: Tg«28+-2 #E+(2,1) #F+(2,1) #EST+(1,1) #FR+(4,2) 
S=84(82,89) Mp= 876.3 (-0,1) Tp= 32 A2=201/3l P=GLAESVSTL 
genpept PR~>gi 1 126527 33 1 gb | AAH00116 . 1 I AAR00116 (BC000116) Similar to 
20 KIAA0174 gen© product [Homo sapiens] POS=95 (SEQ rD NO: 55) 

6=1621: Mg* 878.3: Tg=40+-1 #£+(6,4) #F+(4,2) #EST+(l f l) #FR+{3,2) 
*G+{11,8) #K+(1 # 1) 

0 S=88(92,79) Mp<=> 878.4 ( 0.1) Tp« 40 A2= 55/30 P-AIIGGTFTV 

5 25 genpept PR=>gi 1 6330243 I dbj I BAAS 64 95 . 1 1 (AB033QQ7) KIAA1181 protein 



[Homo sapiens] POS=304 (SEQ ID NO: 56} 



G=1637; Mg«> 880.3: Tg=37+-0 #H+(5,3) 



U! 3-88(89,88) Mp= 880. 4 { 0.1) Tp= 40 A2= 2/23 P-IITGPAPVL 

«p 30 genpept PR°>gi ) 7542357 1 gb I AAF63417 . 1 |AF142422_1 (AF142422) QUAKING 

pg isoform 3 [Homo sapiens] POS=250 (SEQ ID NO: 57) 

% G=1655: Mg= 882.3: Tg^43+-1 #F+(1,1) #G*(6,4) #K+{1,1) 

S~87{87,88) Mp= 882. 3( 0.0) Tp= 43 A2=0.0/18 P=SFDGWATV 
Hj 35 genpept PR=>gi | 726394 4 1 emb I CAB81773 . 1 1 (AJ276359) mucin 4 [Homo 

HJ sapiens] POS=1560 (SEQ ID NO: 58) 

JS3, 

y G=1732i Mg= 894.4: Tg=42+-0 #G+(6,3) 

y, S^79(82,72) Mp- 894. 4 ( 0.0) Tp- 41 A2= 2/20 P-LPPDALVGL 

40 crenpept PR°>gi 11296666! emb I CAA65775.il (X97065) Sec23 protein [Homo 
sapiens] POS=158 (SEQ ID NO: 59) 

G=1737: Mg= 895.3: Tg°14+-1 #G+(4,2) 

3=76(79,72) Mp= 895. 4 ( 0,1) Tp= 22 A2 = 47/25 P-ILDAGGHNV 
45 genpept PR«>gi J 1808578 | dbj I BAA07 918 . 1 1 (D44466) protea3ome subunit 
pll2 [Homo sapiens] POS°736 (SEQ ID NO: 60) 

G=1744: Mg= 896.3: Tg=28+-3 #D+(6,3) #E+(14 f 5) tfF+(20,6) 
#EST+(5,2) #FR+(6,3) #G+(47,11] #K+(8,4) 
50 S°92(98,81) Mp= 896. 4 ( 0.1) Tp= 30 A2-512/30 PoGLYSGVTTV 
genpept PR^>gi | 36065 I emb I CAA42118 . 1 1 (X59543) Ml subunit of 
ribonucleotide reductase [Homo sapiens] POS=4 6 (SEQ ID NO: 61) 

G°1745: Mg- 896.3: Tg=55+-0 #G+(B,4) 

55 5-79(76,86) Mp= 896. 5( 0.2) Tp= 60 A2=>lK/24 P-FLYPFPL 

genpept PR=>gi | 436224 | dbj |BAA05062 . 1 ( (D26067) KIAA0033 [Homo 
sapiens] POS=185 (SEQ ID NO: 62) 



29-MAY-2001 21:13 FROM: 



TO: 972 9 7706333 P. 075 



74 

G-1768: Mg= 898.4: Tg=36+-1 #0+111,3) #E+(50,10) #F+(13,7) 
#EST+(7,2) #FR+(13,3) #G+(63,11) #K+{15,6) #L+(9,3) 

S-81(78,89) Mp= 898.4 ( 0.0} Tp*> 36 A2= 47/28 P=LLDVPTAAV 
5 genpept PR<=>gi | 6165618 1 gbl AAFO « 61 8 . 1 1 AFO 9 7 3 6 2JL < AFO 97362) 

gamma-inter feron. inducible lysosomal thiol reductase [Homo 9apiens] 
POS=27 (SEQ ID NO:l) ref 

G=l?70; Mg= 898.4: Tg=38+-1 #E+(4 # 2) #F+(3,2) #FR+(1,1) #G+(18,11) 
10 #K+(2,1) 

S=8S(92,79) Mp^ 898. 3 (-0,1) Tp= 41 A2- 79/29 P-ALLPSSPTL 
genpept FR=>gi 1 1737205 I gb|AAB38876. 1 1 (U75276) TFIIB related factor 
hBRF (Homo sapiens] FOS=609 (SEQ ID NO: 63) 

15 G-1786: Mg=* 899.5: Tg=26+-2 #F+<2,2) #EST+(4 # 2) #FR+(5,2) 
#G+(32,8) #K+(8,5) #L+(1,1) 

S=93 (96, 89) Mp= 899.51 0.0) Tp= 27 A2=*243/25 F-KLGSVPVTV 
genpept PR=>gi 1 12653653 | gbl AAH00609. 1 1 AAH00609 (BC000609) KIAA0738 
gene product [Homo sapiens] POS=623 (SEQ ID NO: 64) 

20 

G-1795: Mg° 900.4: Tg=*53+-0 #D+(2,2) #E+(7,5) #F+(11, 6) #EST+(2,1) 

#FR+(6,3) #G+(2lrll) #K+(16,6) 
y 5-81(66,72) Mp^ 900, 5 ( 0.1) Tp= 55 A2=182/33 PoALFPGVALL 

genpept PR=>gi I 2245365 |gb I AAC51518 . 1 1 (075885) ER-60 protein [Homo 
|h 25 sapiens] POS<=7 (SEQ ID NO; 65) 

01 

fn G-1802: Mg° 901.3: Tg=33+-2 #E+(3,2) #G+(2,1) 

J>j 5*85(92,69) Mp° 901.51 0.2) Tp= 32 A2=160/29 P=GLVGSLQ£V 

genpept PR->gi | 11967711 |emb iCAC194 84 . 1 | (AJ278357) Tsg24 protein 
30 [Homo sapiens] POS-56 (SEQ ID NO: 66) 



IS 



s G*1804: Mg- 901.4: Tg=31+-2 #E+(1,1) #FR+(1,1) #G+<6,3) 

O S=90{95,79) Mp= 901.3(-0.1) Tp^ 22 A2= 2/18 P=APLSDTAQV 

Ifl genpept PR=>gi 1 10438789 ! dbj | BAB15344 . 1 1 (AK026063) unnamed protein 

35 product [Homo sapiens) POS=197 (SEQ ID NO: 67) 

!g G=1804: Mg- 901.4: Tg=31+-2 #£+(1,1) #FR+(1,1) #G+(6,3) 

S=89{94,79) Mp=> 901. 5( 0,1) Tp= 36 A2=160/33 P-SLASLLAKV 

H genpept PR=>gi 1 8489831 1 gb| AAF75772. 1 IAF265555JL (AF265555) 

40 ubiquit in-conjugating BIR-domain enzyme APOLLON* [Homo sapiens) 
POS*=1230 (SEQ ID NO: 68) 

G-1822: Mg- 903.3: Tg=16+-7 #D+{7,3) #E+(14,4) #F+(14,4) 
#EST+(18,2) tFR+(13,3) #G+(U6,10) #K+(20,6) 
45 S~92 (98,79) Mp» 903.4 ( 0.1) Tp- 23 A2-160/29 P-GLATDVQTV 
genpept PR=>gi 1565647 |dbj I BAA05645 . 1 1 (D26598) proteasome subunit 
HsClO-II [Homo sapiens] POS=55 (SEQ ID NO: 69) 

G*=1860: Mg» 907.5: Tg=39+-1 ftD+{12,4) #£+(1,1) #F+(9,5) #£ST+(3,1> 
50 #FR+(4,2) #G+(10,6) #K+(4,3) 

5-88(91,81) Mp* 907. 5( 0.0) Tp° 37 A2- 79/31 P=SLFGGSVKL 
genpept PR=>gi 1 13375569 i gb | AAK20398 . 1 1 AF349951 JL (AF34 9951) HF95 
[Homo sapiens] POS<=103 (SEQ ID NO: 70) 

55 G-1861: Mg= 907.6: Tg-38+-l #EST+(2 # 1J #FR+(1,1> 
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S=87(87,89) Mp° 907 . 6 ( 0.0) Tp« 32 A2*» 21/19 P«KVGPVPVLV 
qenpept PR=>gi 1 12804 623 1 gb I AAH01734 , 1 | AAH01734 (BC001734) protein 
translocation complex beta [Homo sapiens] POS=67 (SEQ ID NO;71) 

G°1899; Mg° 910.3: Tg=46+-1 #E+(8,5) #F+(21 r 7) #EST+(4,2) 
#FR+(7,3) #G+(24,11) #K+(13,6) #L+(3,2) 

S=72(65,89) Mp= 910. 4{ 0.1) Tp= 40 A2=l82/32 P=GLLPDVPSL 
qenpept PR=>gi 1 13623421 1 gb I AAH06309 . 1 [ AAH06309 (BC006309) Similar to 
RIKEN cDNA 5730589L02 gene [Homo sapiens] POS=141 (SEQ ID NO: 72) 



10 



G=1901: Mg= 910.4: Tg=39+-0 #D+(2,1) #E+(5,2) #F+(9,5) #EST+(1,1) 
#FR+(1, 1) 

3=80(90,59) Mp= 910.4 ( 0*0) Tp« 41 A2=*160/30 P-ALPPVLTTV 
qenpept PR=>gi | 38821 B3 | dbj | BAA34 451 , 1 1 (AB018274) KIAA0731 protein 
15 [Homo sapiens] POS=l31 (SEQ ID NQ:73) 

G°1904: Mg= 910.4: Tg=38+-1 #E+(2,1) ttF+(3,2) #EST+(1,1) #FR+(3,2) 
#G+(3,3) #K+(2,1) 

S°90(95,79) Mp= 910. S{ 0,1) Tp° 32 A2= 52/24 P-GVLFNIQAV 
20 qenpept pR«>gi | 72 64004 | emb | CAB81656 . 1 1 (AL049822) dJl60A22>4 
(histone H2A) [Homo sapiens] POS^107 (SEQ ID NO: 74) 

^ G=1922: Mg= 912.5: Tg=42+-1 #E+(5,4) #F+(2,1) #FR+(1,1) #G+(3,2) 

^ #K+(1,1) 

£j 25 S=78 (S3, 69) M P ° 912. 5( 0.0) Tp= 43 A2- 49/31 P=A1TPVWTL 

§1 genpept PR=>gi 1 13177739 I gb I AAH0364 4 . 1 1 AAH0364 4 (BC00364 4} 

|]| cyclin-dependent kinase 4 [Homo sapiens] ' POS=170 (SEQ ID NO: 75) 

1~ G=1931: Mg= 913.4: Tg=34+-1 #E+(8,4) #F+(2,2) #£ST+<3,2) #FR+(1,1) 

% 30 S~84(96 ( 59) Mp= 9l3.3(-0.1) fp- 29 A2= 70/27 P=ALNPADITV 
W qenpept PR=>gi I 6634421 1 emb I CAB64373 . 1 1 (AJ238375) putative protein 

* TH1 [Homo sapiens] POS=103 (SEQ ID NO: 76) 

if 55 ! 

|Jj G°1933: Mg= 913.4: Tg-49+-0 #S+<12,2) #D+(17,5) #E+(16,8) 

Sj 35 #F+(18,7) #EST+<2,1) #FR+(4,2) #G+(22,11) #H+(1,1) #K+<10,6) 

^ 3=93(95,89} Mp= 913. 6 ( 0.2) Tp= 4 6 A2-182/31 P°GLLGTLVQL 

^ qenpept PR*>gi 1 38520 I embl CAA7 94 97 . 1 1 (Z19054) beta catenin [Homo 

W sapiens] POS=400 (SEQ ID NO: 5) 

40 G=1939: Mg= 914.4: Tg=*40+-0 #G+(4,3) 

S=82(79, 89) Mp° 914. 4( 0.0) Tp= 42 A2-0.0/16 P-DAEGLALLL 

qenpept PR=>gi 1 1060907 | dbj I BAA11242 . 1 1 (D78177) quinolinate 

phosphoribosyl transferase [Homo sapiens] POS=2 (SEQ ID NO: 77) 

45 G=1942: Mg° 914.5: Tg=16+-3 #F+(4,1) #£+(6,3) 

5=90(95, 79) Mp- 914.4 (-0.1) Tp- 27 A2=160/29 P-SLTGHISTV 

qenpept PR=>gi | 28322 96 1 gb I AAD094 07 . 1 | (AF044333) pleiotropic 
regulator 1 [Homo sapiens) POS=241 (SEQ ID NO:78) 

50 G=1948: Mg= 915.5: Tg=38+-0 ttD+(2,l) #F+(11,7) 

3=91(97,77) Mp= 915. 6( 0.1) Xp= 42 A2-0.5/15 P=VHVLTFTV 
qenpe_pt PR=>gi I 3242214 I emb i CAA07243 . 1 1 (AJ006778) DRIM protein [Homo 
sapiens] POS=1896 (SEQ ID NO: 79) 



55 



G=1974: Mg~ 918.3: Tg=36+-1 #F+l3,2) #G+{19,10) 
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5=84(88, 77) Mp= 918. 6< 0-3) Tp=* 34 A2= 6/25 P^SLKYVPLV 
genpept pR=>gi 1 10436278 I dbj | BAB14783* 1 1 (AK024024) unnamed protein 
product [Hojtio sapiens] P0S°248 (SEQ ID NO:70) 

G=l979: Mg= 918.6: Tg=53+-0 #E+(5,3) #F+(6,3) #£ST+(1,1) #FR+<4,2) 
#G+(4,2) #K+(4,3) 

S«81£84,74) Mp<= 918.5(-0.1) Tp= 54 A2^0.8/16 P-LPYWGVAL 
genpept PR=>gi I 7023639 | dbj | BAA92Q35 . 1 1 (AK002014) unnamed protein 
product [Homo sapiens] POS=272 {SEQ ID NO:71) 



10 



G=1988: Mg= 920.3: Tg=32+-1 #£+{2, 2) #F+(S,2) #FR+{1,1) #6+05,11) 
#K+<1,1) 

S«89{90,89) Mp= 920-3 ( 0.0) Tp= 27 A2= 31/24 P=SIYPSPTGV 
genpept PR=>gi i 3661610 [ gb | AAC61776 . 1 1 (AF092565) splicing factor 
15 Prp8 [Homo sapiens] P0S=1693 (SEQ ID NO:72) 

G=2008: Mg= 922.3: Tg=58+-1 #S+{9,1) #D+(17,5) #L+{8,2) 
S-79(80,77) Mp= 922, 5 ( 0.3) Tp« 59 A2=245/22 P«ALFGALFLA 
genpept PR=>gi i 2653432 | dbj I BAA23647 , 1 1 (AB005297) BAI 1 [Homo 
20 sapiens] P0S-1163 (SEQ ID NO: 6) 

G=2023: Mg= 924.2: Tg=l5+-14 #F+(6,3) #FR+<1,1) 
^ S=83(86,79) Mp= 924 .4 ( 0.2) Tp^ 33 A2° 11/24 P«ALASHLIEA 

* genpept PR->gi I 7212607 | gb I AAF40470 . 1 1 AF181263_1 (AF181263) EH domain 

25 containing 2 [Homo sapiens] POS-=507 (SEQ ID NO; 73) 

|H G=2027: Mg= 924,5: Tg=13+-1 #G+(3,2) 

Iff S«83(85,79) Mp= 924>4{-0,l) Tp= 20 A2= 75/24 P-KLGPAPKTL 

jjl genpept PR=>gi i 4 08198 i gb | AAB27 691 . 1 i (S64671) DNA-binding 

30 protein/plasminogen activator inhibitor-1 regulator [human, HeLa S3, 
Peptide Partial, 176 aa] [Homo sapiens] POS-133 (SEQ ID NO: 74) 



O G=2029: Mg^ 924.5: Tg-43+-l #F+(1,1) #G+(17,8) 

111 S-93{91,99) Mp° 924. 6 ( 0.1) Tp<= 44 A2«>lk/27 P-KLLEPVLL 

f],j 35 genpept PR->gi 1 338447 |gb | AAA60583.il (M60854) RPS16 [Homo sapiens] 

POS=50 (SEQ ID NO:75) 



40 



G-2O50i Mg= 926.5: Tg=14+-3 #F+(8,2) #EST+(l f l) #FR+(3,2) #G+(6,3) 
S=90(96,79) Mp= 926. 4 (-0,1) Tp= 29 A2= 78/30 P=A,LSGHL£TV 
genpept PR=>gi 1 12314197 | emb | CAB99342 . 1 1 (AL13900S) bA255A11.8 (novel 
protein similar to annexin A2 (ANXA2) (lipocortin II, calpactin I 
heavy chain f chromobindin 8, PAP-IV) ) [Homo sapiens] POS=90 (SEQ ID 
N0:76) 

45 G=2068; Mg° 929.5: Tg=43+-1 #E+(2,2) #F+(20,7) #FR+(1,1) 
#G+06,H) #K+<24,6) #L+(3,2) 

S=92(90,99) Mp= 929-5 ( 0.0) Tp= 31 A2=173/25 P-SLLDKIIGA 
genpept PR«>gi 1 11034809 I gb| AAG27093.1 1 AF312393_1 (AF312393) 

leucine-2ipper protein FKSG13 [Homo sapiens] PGS=56 (SEQ ID NO:77) 



50 
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G-2071: Mg= 930.3: Tg=35+-1 #F+(4,3) 4G+(l3,7) #K+(3,2) 
S=91(97,77) Mp» 930.4 ( 0.1) Tp= 33 A2^257/33 P=GLLGAGGTVSV 
genpept PR=>gi 1 114 93522 I gb I AAG35534 . 1 1 AF130117_68 (AF130109) PR01512 
[Homo sapiens] PQS=*17 (SEQ ID NO: 78) 

G=2072: Mg= 930.4: Tg=53+-0 #D+ (8, 4 ) #£+(21,8) #F+(10 f 5) 
#EST+(7,2) #FR+(6,3) #G+(11,7) #K+(7,4) 
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S=78<83,69) Mp- 930. 6( 0.2) Tp= 53 A2=6Q8/32 P=GLVPFLVSV 
qenpept PR->gi 1 13543657 1 gb | AAH05978 . 1 1 AAH05978 (BC005978) 

karyopherin alpha 2 (RAG cohort l, importin alpha 1) [Homo sapiens] 
PQS«377 (S£Q ID NO;79)ref 

G=2095: Mg= 932.5: Tg*4 6+-0 #F+(13,7) #G+(1,1) #K+(2,1) 
S=72(74,69) Mp* 932*51 0.0) Tp= 46 A2^ 54/27 P=ILGLGYPSL 
qenpept PR->gi I 7339520 I etnb I CAB82850. 1 1 (AJ250717) procathepsin E 
[Homo sapiens] POS=l84 (SEQ ID MO: 80) 



G-2126; Mg- 936.3: Tg=39+-1 #E+(1,1) #F+(7,5) #G+[1,1) 
S=81 (86, 72) Mp- 936.4 ( 0.1) Tp= 40 A2=213/26 P=ALLAGSEYL 
qenpept PR^>gill2653123lgb|AAH00328.11AAH00328 (BC000328) eukaryotic 
translation initiation factor 3, subunit 7 (zeta, 66/67kD) [Homo 
15 sapiens] P05=439 (SEQ ID NO: 81) 

G=2146: Mg= 938.3: Tg=38+-0 #FE+(2 ( 1) 

S=77(72 r 90) Mp= 938.5 ( 0,2) Tp= 34 A2=656/33 F^SLAELVHAV 
qenpept PR*>gi I 4092863 | gb | AAD04812 . 1 1 (AF033122) non-p53 regulated 
20 PA26-T1 nuclear protein (Homo sapiens] POS=254 (SEQ ID NO: 82) 

G=2l"60i Mg* 940.4.* Tg*58+-1 #E+{6,4) #F+(8,3) #G+(1,1) #K+(3,2) 

^ So82(80, 86) Mp= 940. 6( 0.2) Tp= 53 A2- 8/15 P-MQPILLLL 

*f qenpept PR=>gi 1 181159 1 gb I AAB59528 . 1 1 (J03072) serine protease 6 

CO 25 [Homo sapiens] POS=l (SEQ ID NO: 83) 

|J1 

Iff G=2176: Mg= 942,1: Tg=48+-0 #G+(5,3) 

pi S=74 (74,77) Mp= 942. 5 ( 0,4) Tp= 48 A2- 2/12 P=GLFAPQFY 

qenpept PR=>gi 1 2062371 1 gb | AAB65850 . 1 1 (U7Q730) SnoN2 [Homo sapiens] 

J£" 30 POS-274 (SEQ ID NO: 84) 

L G=2208; Mg= 944.5: Tg=59+-2 #S+(2,1) #L+(4,2) 

O S=86(94,69) Mp° 944. 5 ( 0.0) Tp= 48 A2=577/25 P=ALWGQG1%V 

HI genpept PR=>gi 1773628 1 gb | AAA88873 . 1 1 (U21267) immunoglobulin rou 

fy 35 heavy chain [Homo sapiens) POS«103 (SEQ ID NO:85) 

kD 

p G-2213: Mg° 945.4: Tg=37+-1 #S+(12,1) #£+(60,10) #F+(6,5) 

#E$T+(5,2] #FR+(ll f 3) #K+(1 # 1) 

S=87(97 f 66) Mp= 945. 5 ( 0.1) Tp= 34 A2*592/34 P=SLLGGDVVSV 
40 qenpept PR=>gi 1 5231131 1 gb | AAD41085 . 1 | AF153603_1 (AF153603) TSC-22 
related protein [Homo sapiens] POS=27 {SEQ ID NO: 7) 

G=2231: Mg= 947,3: Tg=41+-1 #E+(4,2) #FR+(1,1) #G+{11,6) 
5=85(88,81) Mp= 947. 3( 0.0) Tp= 20 A2^0.0/18 P=DTETAWNV 
45 qenpept PR->gi | 4883681 | gb I AAD315 96 . 1 | AF057352_1 (AF057352) 

hepatocellular carcinoma autoantigen [Homo sapiens] PQS«117 (SEQ ID 
NO:86) 

G«2233: Mg= 947.4: Tg=34+-1 #S+(9,1) #EST+(3,2) #FR+{4,2) 
SO 3-90(90, 90) Mp= 947.4 { 0.0) Tp= 30 A2= 70/23 P=NLTISDVSV 
genpept FR=>gi I 541680 i emb ICAA56734 . 1 1 (X80761) M0C1 [Homo sapiens] 
POS=133 (SEQ ID KO:8) ref 



55 



G«2241: Mg= 948.3: Tg=58+-1 #E+(6,3) #F+(4,3) #EST+(1,1) #FR+(3,2) 
#K+<5,3) 
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5-75(74,79) Mp= 948, 5( 0.2) Tp= 62 A2=203/21 P=ALLPIFFGA 
genpept PR=>gi 1 13185197 I emb | CAC33282 . 1 1 (AX083359) unnamed protean 
product [Homo sapiens) POS-43 (SEQ ID 110:87) 

5 G»227Q: Mg= 951.6: Tg-40+-l #D+(11 # 4) #£+(49,10) #F+(2 f l) 
#EST+(3,U #FR+<7,3) #G+<11,6) ttK+(4,2) #L+(4,2) 

S=88<93,79) Mp° 951«3(-0.1) Tp= 33 A2=191/22 P^AMVIFKSGV 
genpept PR=>gi I 3929529 I gb | AAC82612 . 1 1 (AF0346U) intrinsic 

factor-Bl2 receptor precursor; cubilin [Homo sapiens] POS°3371 (SEQ 
10 ID WO; 88) 

G-2299: Mq= 954.4: Tg=50+-0 #D+(1,1> #E+(31,9) #F+<15,7} 
#EST+(5,2) #FR+(8,3) #G+(28,11) #K+(16,6) #L+(4,2) 

S=81(89,63) Mp=^ 954. 5 { 0,1} Tp= 49 A2=l82/34 P=$LLPAIVEL 
15 genpept PR=>gi I 3603418 ! gb j AAC63525 . 1 1 (AF083439) protein phosphatase 
2A regulatory subunit A r beta isoform [Homo sapiens] POS«415 (SEQ 
ID NO:89)ref 

G=2329: Mg* 956.6: Tg=33+-2 #E$T+(3,2) #FR+(5,3) 
20 5=81(83,79) Mp= 956, 5 (-0.1) Tp= 32 A2-736/32 P-YLGPHTASV 
genpept PR=>gi 1 12052942 I emb I CAB6664 6 . 1 I (AL136711) hypothetical 
protein [Homo sapiens] POS-137 (SEQ ID NO: 89) 

G=2344: Mg= 958.3: Tg=33+-2 #S+(6,2) #D+(8,2) #E+(100,8) 
t0 25 #G+(34,U) #K+(4,2) 

01 3*96(96,99) Mp- 958. 3< 0.0) Tp* 38 A2= 41/23 P=SLWGQPAEA 

||| genpept PR=*>gi I 463430 I gb I AAC27816 . 1 [ (U04520) type IV collagen alpha 

ifj 5 chain (Homo sapiens] POS^18 {SEQ ID MO: 9) 



30 G°2350: Mg= 959,3: Tg=46+-1 #G+(14,7) #K+(8,5) 

S=79(90,54) Mp= 959. 6( 0.3) Tp= 45 A2= 16/27 P=SLFPGQWI 
qenpept PR=>gi | 12654 999 I gb | AAH01347 . 1 1 AAH01347 (BC001347) polymerase 



|J (DMA-directed), alpha (70kD) [Homo sapiens} POS=295 (SEQ ID NO:90) 



35 G-2355: Mg° 959.5: Tg-38+-0 #F+(3,2) 

S=82(80,89) Mp= 959.5 ( 0.0) Tp= 38 A2°324/29 F=SLLEKSLGL 
genpept PR«>gi 1 13529002 I gb| AAH05291. 1 | AAH05291 (BC005291) eukaryotic 
translation elongation factor 1 epsilon 1 [Homo sapiens] POS=8 (SEQ 
ID NO: 91) 

40 

G*=2356: Mg« 959. S; Tg=30+-1 #D+(3,1) #£+(29,9) #F+(2,2) #EST+<7,2) 
#FR+(9,3) #G+(49,10) #K+(6,2) 

S=85(84,90) Mp<= 959.5 ( 0.0) Tp= 27 A2=485/24 PoILTDITKGV 
genpept PR<=>gi 1 181969! gb I AAA50388 . 1 f (M1999?) elongation factor 2 
45 [Homo sapiens] POS<L61 (SEQ ID NO: 92) 

G=2372: Mg= 960,5: Tg=35+-1 #D+(2,1) #G+(5,5) 

S=78(73,90) Mp= 960.5 ( 0.0) Tp= 34 A2= 79/24 P°GLFQGKTPL 

genpept PR=>gi | 4589929 1 dbj t BM7 6931 . 1 1 (AB024704) fls353 [Homo 
50 sapiens] POS=53 (SEQ ID NO: 93) 

G=2382: Mg= 962.3: Tg«47+-0 #G+f$,4> 

S=80(82,77) Mp= 962. 5{ 0-2) Tp= 11 A2=0.0/6 P°ESQLKKMV 
genpept PR=>gi| 12803337 | gb I AAH02487 . 1 IAAH02487 (BC0024 87) tumor 
55 susceptibility gene 101 [Homo sapiens] POS=5 (SEQ ID NO: 94) 



G=2434: Mg= 967.3: Tg=54+-0 #G+(10,7) 
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5=83(85,79) Mp= 967. 5( 0.2) Tp= 61 A2=139/19 P=FLYPFPLA 

2enpe£t PR=>gi | 436224 | dbj j BAA05062 . 1 1 (D26067 ) KIAA0033 [Homo 
sapiens] POS<=185 (SEQ ID NO: 95) 

5 G-2446: Mg= 968.4: Tg=20+-2 #F+(4,2) #G+<i9,6) 

3=86(89,79) Mp= 968.4 ( 0.0) Tp= 29 A2= 78/29 P-ALTGHLEEV 
<?enpept PR°>gi I 34388 lembjCAA29338.il (X05908) lipocortin (AA 1-346) 
[Homo sapiens] POS=99 (SEQ ID NO; 96) 

10 G=2447: Mg-> 968.4: Tg-42+-l #D+{1,1) #E+(1,1) #F+U1,7) #FR+{1,1) 
#G+(27 # 11) #K+(6,3) #L+{3,2) 

S=87(87,90) Mp= 968.4 ( 0.0) Tp= 36 A2=>lk/33 P=SLLDPVPEV 
genpept PR=>gi| 1504020 I dbj |BAA13209.1| (D86973) similar to Yeast 
translation activator GCN1 (P1:A48126) [Homo sapiens] POS=1406 (SEQ 
15 ID NO:97) 



G=2464: Mg= 969.5: Tg=47+-0 #E+(2,2) #F+(7,5) #G+(2S,11) 
5=83 (85,81) Mp^ 969. 5{ 0.0) Tp= 48 A2^ 1/19 P-MAPQALLLL 
genpept FR->gi 1 1780998 I embj CAA71531 . 1 1 (Y10520) HLA-C alpha chain 
20 (Cw*1701) [Homo sapiens] POS<=4 (SEQ ID NO: 98) 

G=2489: Mg= 971.5: Tg=42+-0 #D+(9,4) #F+ (10, 6) 

S=88(9l,81) Mp- 971.4 (-0.1) Tp= 42 A2= 1/23 P-FSNGYLASL 
genpept PR=>gi | 12 655065 | gb I AAH01382 . 1 | AAH01382 (BC001382) solute 
25 carrier family 29 (nucleoside transporters), member 1 [Homo sapiens] 
POS=405 (SEQ ID NQ:99) 

G=2495: Mg= 972.4: Tg=52+-1 #D+(25,5) #E+(19,9) #F+(24,7) 
#EST+(5,2) #FR+(8,3) #G+(43,11) #K+{32,6) #L+(4,2) 
30 S=91(96,8l) Mp= 972. 5 ( 0.1) Tp° 41 A2=656/30 P==TLIEDILGV 
genpept PR->gi| 11121497 | embj CAC14 94 6.1 | (AL132825) dJ756N5.2 (novel 
K protein (DKF2p727M23l) similar to Trp4-asscciated protein TAP1 

f«l (ABCB2) ) [Homo sapiens] POS=209 (SEQ ID NO:X00) 

Ifl 

~: 35 G=2514: Mg= 973.4: Tg=33+-1 #F+{5,3) »G+(10,6) #K+(4 ( 2) #L+<1,1) 

^ $=82(84,79) Mp= 973.4 ( 0.0) Tp= 31 A2=0.0/17 P=IAEAVRTTL 

tf3 genpept PR^>gi 1 2559010 1 gb [AAC96011 . 1 1 (AF026292) chaperonin 

P containing t-complex polypeptide 1, eta subunit; CCT-eta [Homo 

sapiens] POS^32 (SEQ ID NO: 101) 



in 



40 



45 



G=2515; Mg~ 973,5: Tg=*34+-1 #EST+(5,2) 

S=80{80,81) Mp= 973.4 (-0.1) Tp= 31 A2=307/27 P^KLSELEAAL 
genpept PR=>gi 1 12314174 | emb | CAC08001. 1 1 (AL137067) bA13B9.3 (novel 
protein similar to KRT8 ) [Homo sapiens] POS-368 (SEQ ID WO: 102) 



G°2522: Mg= 974.3: Tg«30+-2 #S+(7,1) #£+(3,3) #EST+(8,2) 
#FR+{11,3) 

3=89(90,89) Mp= 974. 5 ( 0.2) Tp= 25 A2= 6/21 P=SLSVKL£QA 

genpept PR=>gi I 37258 I emb I CAA44819 . 1 1 (X63105) Xpr [Homo sapiens] 
50 POS=453 (SEQ ID NO: 104) 

G-2527: Mg= 974.3: Tg«50+-0 #Dt(l,l> #E+(15,7) #F+(9,4) #G+(22,10) 
#K+(10,5) 

S°90(98,72) Mp<= 974.5 ( 0.2) Tp= 50 A2=4 13/26 P=MLLAALMIV 
55 genpept PR=>gi | 5802822 I gb | AAD51798 . 1 1 AF164 6l4_2 (AF164614) envelope 
protein [Homo sapiens] POS^76 (SEQ ID NO: 105) 
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G=2537: Mg- 974.5: Tg=53+-0 #F+(4,3) #G+{13,9) #K+(2,2) 
S=81(83,79) Mp- 974. 5 ( 0,0) Tp^ 56 h2^ 60/24 P=AILPTSIFL 
genpapt PR->gi 123234101 gb|AAB66581. 1| (AF015913) SkblHs [Homo 
sapiens] PO$<=229 (SEQ ID NO; 106) 

5 

G=2546; Mg- 975.4; Tg-38+-l #E+(3,2) #F+(7,4) #G+<19,10) #K+(1,1) 
S=82(91,63> Mp= 975. 4 ( 0.0} Tp- 32 A2 = 8/27 P°AALPNVYEV 
qenpept PR-=>gi 1 12652781 1 gb |AAH00142 . 1 1 AAH00142 (BC000142) 

minichromosome maintenance deficient (S. cerevisiae) 5 (cell 
10 division cycle 46) [Homo sapiens] POS°326 (SEQ ID NO: 107) 

G=2567: Mg= 977.5; Tg<=>22+-3 #G+(9 r 5) 

$"84(82,90) Mp= 977. 4 (-0.1) Tp= 24 A2=18 6/24 P=RMLPHAPGV 
qenpept PR=>gi 1 1667394 I gb | AAC50814 ( 1 1 (U31814 ) transcriptional 
15 regulator homolog RPD3 [Homo sapiens] POS=372 (SEQ ID NO: 108} 

G=2610: Mg* 981.7: Tg=36+-0 #S #F+(3,2) 

S=79(80,79) Mp= 9B1.6(-0.1) Tp= 38 A2= 49/32 P=SLIGHLQTL 
qenpept PR=>gi | 642013 | gb| AAB06261 . 1 | (U16996) protein tyrosine 
20 posphatase [Homo sapien3] POS-337 (SEQ ID NO: 10) 

G=2636: Mg= 984.5: Tg-61+-1 #D+(S,4) #£+(9,5) #F+(12,6) #FR+(5,2) 
O #G+{2,1) #K+<12,5) #L+(1,1) 

S=85(91,72) Mp- 984, 7( 0.2) Tp= 61 A2*= 11/21 P=LMVLVALIL 
|J 25 qenpept PR=>gi 1 12654925 1 gb 1AAH01309 . 1 1 AAH01309 (BC001309) Unknown 

p (protein for MGC:5508) [Homo sapiens] POS=19 (SEQ ID NO: 109) 

^ G=2641: Mg= 984.7: Tg=36+-0 #EST+(1,1) #FR+(2 f l) 

U1 S=78(77,81> Mp° 983.5 (-1.2) Tp= 35 A2-140/28 P=KILPTL£AV 

jp 30 genpept PR=>gi 1 12653227 I gb | AAH00382 , 1 1 AAH0Q382 (BC000382) 

interleukin enhancer binding factor 2, 45kD (Homo sapiens] POS=127 
(SEQ ID NO:110) 



G=2649: Mg= 965.5: Tg^40+-1 #E+(5,3) #FR+(3,2) #G+(4,3) 
jp. 35 S°84(93,63) Mp= 985. 6f 0.1) Tp= 38 A2°>lk/33 P=ALLDRIVSV 

^1 qenpept PR->gi 1 1504030 | dbj I BAA13214 . 1 1 (D36978) similar to a 

yy c.elegans protein encoded in cosmid K12D12 (249069) [Homo sapiens] 

O P0S=1499 (SEQ ID NO:lll) 



40 G-2661: Mg- 986.6: Tg=35+-1 #E+<3,2) #F+{3,2) #EST+(3,2) #FR+(1,1) 
#G+(2,2) 

S=84(82,89) Mp= 986. 7 ( 0.1) Tp= 35 A2=>160/26 P=TLVYHWGV 
qenpept PR=>gi I 3540219 1 dbj I BAA32662 > 1 1 (D87686) KIAA0017 protein 
[Homo sapiens] POS-165 (SEQ ID NO: 112) 



45 



50 



55 



G=2666: Mg= 987.4: Tg-32+-2 #D+(1,U #E+U,1) #F+(5,2) #G+(12,7) 
S=77(87,54) Mp= 987. 5 ( 0.1) Tp= 33 A2=131/26 P=>YLPPATQW 
qenpept PR=>gi i 1332514 6 I gb S AAH04366. 1 1 AAH04 386 (BC004386) KIAA0111 
gene product [Homo sapiens] POS=207 (SEQ ID NO: 113) 

G=2668: Mg= 987,4: Tg=14+-13 #F+(4,3) 

5=77(76,81) Mp= 987.3 (-0.1) Tp- 26 A2-0.0/15 P-PMEALAEQV 
qenpept PR=>gi | 3882297 | dbj | BAA34 508 . 1 | (AB018331) KIAA0788 protein 
[Homo sapiens] POS=569 (SEQ ID NO: 114) 

G=2671: Mg= 987.6: Tg=29+-1 #F+(4,3) #EST+(1,1) #FR+(3,2) 
#G+{11,5) 



m „ M TO: 972 9 7706333 
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S=7fl{83,54) Mp= 987, 5 (-0.1) Tp- 33 A2-656/30 P-RLSEAIVTV 
g« n pept PR->gi| 7106848 I gb| AAF36149 . 1 1 AF151063J. (AF151063) HSPC229 
{Homo sapiens] POS=137 {SEQ ID NO: 115) 

5 6=2677: Mg= 988 t 3: Tg=13+-4 #E+(2,1) #F+{7,1) #EST+(4,1) #6+ (21, 6) 
S=88<99,63) Mp*= 988.4 ( 0,1) Tp= 20 A2<* 28/27 P^SLDQPTQTV 
? en P®F£ PR ss >gi| 1718197 !gb|AAD034 62. 1| (U46025) translation intiaticn 
factor eIF-3 pllO subunit [Homo sapiens] POS=834 (SEQ ID NO: 116) 

10 G-2692: Mg- 989.4: Tg-41+-1 #S+(8,2) #D+(13,5) #E+(12,6) #F+(ll f 6) 
#EST+(4 / 2) #FR+(6,3) #G+ (15, 8) #K+(13,6) 

S=79(83,72) Mp*= 989. 5( 0.1) Tp=> 39 A2=257/30 P=SLFPGKLEV 
genpept PR->gi ! 440177 | gb (AAC03568 . 1 | {0011841 fiightless-I homclog 
[Homo sapiens] POS-1009 (SEQ ID NO: 12) 

15 

G=2693: Mg« 989.5: Tg=*31+-2 #S+(X5,2) #E+(7,3) #F+(13,5) 
#EST+(3,1) #G+(12,7) ttK+(6,4) 

S=^83(84, 8l) Mp« 989. 5{ 0.0) Tp= 35 A2= 88/29 P=SLSEKTVLL 
genpept PR=>gi|180151igbiAAA88793.1| (M84349) CD59 protein [Homo 
20 sapiens] POS-106 (SEQ ID NO: 11) 

G=2729: Mg= 993.5: Tg=18+-4 #F+<2,1) #EST+(4,2) #FR+(8,3) #G+(9,4) 
#K+(1,1) 

S=92(97,81) Mp= 993. 6 ( 0.1) Tp= 22 A2=243/23 P=KLHGWINV 
25 genpept PR=>gi | 12653083 | gb I AAH00307 . 1 | AAH00307 (BC000307) RNA 
binding motif protein 4 [Homo sapiens] PQS^59 (SEQ ID NO: 117) 

G=2769: Mg= 999.5: Tg=35+-1 #H+(5,3) #I+(8,4) #J+(5,4) 
S=82(83,B1) Mpo 999. 5 { 0.0) Tp= 39 A2= 5/18 P-LVMAPRTVL 
30 genpept PR=>gi I 9738918 I gb j AAF97847 . 1 1 (AF129293) MHC class I antigen 
(Homo sapiens] POS=2 (SEQ ID NO: 118) 

G-2773: Mg= 999.6: Tg=45+-1 #D+(2,1) #£+(15,6) #F+(12,7) 
#EST+(3,1) #FR+(8,3) #G+(15,8) #K+(11,5) #L+(1,1) 
35 S=80(86,69) Mp= 999. 6( 0.0) Tp= 42 A2« 22/31 F=SIIGRU.EV 
genpept PR=>gi i 190516 | gb I AAA36508 . 1 | (M63960) protein phosphatase-l 
[Homo sapiens] POS=ll (SEQ ID NO: 119) 

G=>2785: Mg*1000.5: Tg=33+-1 #G+(14,6) #K+(2,2) 
40 S=77(77,79) Mp=l000.6{ 0.1) Tp- 36 A2= 2/16 P=MAVALQLRV 
genpept PR»>gi 1 11544742 I emb I CAC17582 , 1 1 (AL121997) dJ1043F6.1.1 
(Chediak-Higashi syndrome 1 (isoform 1)) [Homo sapiens] POS=2544 
(SEQ ID NO: 120) 

45 G=2789: Mg«=.1000.6: Tg=^26+-2 #F+{3,2) #EST+(2,2) #FR+(2,1) 
#G+(13,6) #K+(1,1) 

S^90(9Q,90) Mp-1000,4 (-0.2) Tp- 27 A2*656/30 P=GLNEEIARV 
genpept PR=>gi I 2501873 | gb IAABB0726 . 1 1 (AF017790) 

retinoblastoma-associated protein EEC [Homo sapiens] POS°330 (SEQ 
50 ID NO:121) 

G-2791; Mg^l001.3: Tg=40+-1 #F+(10,5) #G+(16,9) #K+(4 f 3) 
S=78(81,72) Mp-100l.6( 0.3) Tp= 19 A2-0.9/23 P=IMKVAQAKL 
genpept PR=>gi I 6941888 i gb | AAF32263 . 1 [ AF170562 J. (AF170562) 

55 ubiquitin-specific processing protease [Homo sapiens] POS«=875 (SEQ 
ID NO: 122) 
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G=2822; Mg«1004.2: Tg=27+-1 #G+(6,5] 

S=90(91,90) Mp=l004.4( 0.2) Tp= 30 A2= 88/25 P-TLSEVTNQL 
genpept PR=>gi | 12053045 | emb [CAB66698 . 1 | (AL1367 64 ) hypothetical 
protein [Homo sapiens] POS=484 (SEQ ID NO: 123) 

G=2829: Mg=1004.5: Tg=38+-1 #F+(3,2) #EST+(1 # 1) #FR+(2,1) #G+{9,6) 
S=91(92,90) Mp=1004 t 6( 0.1) Tp<= 37 A2-324/29 P=>ALFEGKVQL 
genpept PR=>gi | 10439712 | dbj | BAB15S50 . 1 | (AK026730) unnamed protein 
product (Homo sapiens] POS=442 (SEQ ID NO: 124) 

G=2833: Mg=1004,6: Tg=29+~0 #EST+(3,1) 

S=*87(87, 89) Mp-1004.6( 0.0) Tp= 31 A2 = 32/28 P-GLKGRVFEV 
genpept PR=>gi | 854 179 1 emb I CAA60827 . 1 1 (X87373) ribosomal protein S3a 
[Homo sapiens] POS=61 (SEQ ID NO: 125) 

G=2835; Mg=1005.2: Tg°48+-0 #G+{3,3) 

S=84(S3,89) Mp^l005.5( 0.3) Tp«= 42 A2= 35/25 P-NIFPYPVGV 
genpept PR->gi 1 2822460 I gb| AAC39565 . 1 1 (AF030234) splicing factor 
Sipl [Homo sapiens] POS°912 (SEQ ID NO: 126) 



G=2872: Mg-1009.6; Tg=47+-1 #E+£2,1) #EST+(5,2) #FR+(6,3) 

#K+(15,6) 

P S=87(96,66) Mp=1009,7( 0.1) Tp= 52 A2 = 3/18 P^LVSIWAVPL 

iO genpept PR=>gi | 702313 6 | dbj I BAA91851 , 1 | (AK001708) unnamed protein 

!B 25 product [Homo sapiens] POS=23 (SEQ ID NO; 127) 

m 

G-2881: Mg=1010,S: Tg=28+-1 #G+(8,5) 
^ S=84 (85,82) Mp=10l0.5( 0.0) Tp- 20 A2=370/30 P=NMYGKWTV 

^ genpept PR=>gi ! 1845267 | gb | AAC51102 . 1 | (CJ56402) SDPT5H [Homo sapiens] 

T 30 POS=562 (SEQ ID NO: 128) 



s G=2891: Mg-1011.5: Tg=43+-1 #E+(9,5) #F+(1,1) #EST+{3,2) #FR+(6,3) 

Q #G+(13,7) #K+(5,3) #L+(3,2) 

Ifl S=79<78, 82) Mp=>10ll.5( 0.0) Tp- 45 A2=>lk/3l P=LLLDVPTAAV 

35 genpept PR°>gi I 6165618 I gbl AAF04618 . 1 1 AF097362_1 (AF097362) 

l Z gamma-interf eron inducible lysosomal thiol reductase [Homo sapiens] 
POS°26 (SEQ ID NO: 2) ref 

!□ 



G=2918: Mg=1014.4: Tg=48+-0 #D+(1,1) #£+(16,8) #F+(11,7) 

40 #EST+(2,1) #FR+(3,2) #6+119,10) 

S=88<97,68) Mp=1014.6( 0.2) Tp= 4 6 A2=160/32 P-SLINVGLISV 
genpept PR=>gi| 12 6534 13 I gb | AAH0047 6 . 1 1 AAH0047 6 (BC000476) acidic 
protein rich in leucines [Homo sapiens] POS=48 (SEQ ID NO: 129) 

45 G°2928: Mg=1015.4: Tg=56+-0 #E+(26,8) #EST+(2,1) #FR+(5,3) 

S=92(97,81) Mp=1015.5( 0,1) Tp^ 61 A2-666/30 P=ALLGTLWEI 
genpept PR=>gi I 2224595 I dbj | BAA20785. 1 1 (AB002325) KIAA0327 protein 
[Homo sapiens] POS-18 {SEQ ID NO: 130) 

50 G=2929: Mg=10l5.4: Tg°41+-1 #E+(5,3) #EST+(4,2) #FR+<2,2) 

#G+(12,7) #K+{5,3) 

3=81(86,72) Mp=1015.5( 0.1) Tp= 39 A2= 13/16 P=FQDPVPLTV 
genpept PR=*>gi I 4325107 I gb I AAD172S8 , 1 1 (AF11904 2 ) transcriptional 
intermediary factor 1 alpha; TIFlalpha [Homo sapiens] POS=890 (SEQ 
55 ID NO:131) 
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G=2947: Mg=1016.4: Tg=45+-1 #E+(3,3) #F+(8,5) #EST+(2 / 2) #FR+ (3,1) 
#G+(18,10) #K+(7,4) ' 
5=82(95,54) Mp-1016.6( 0.2) Tp= 39 A2o5i 2 /28 P-GLYPNLIQV 
9^*^ ?R=>gi|4240269|dbj|BAA749l3.1| (ABQ20697) KIAA0890 protein 
[Homo sapiens] P0S=1Q22 (SEQ ID NO; 132) 

G=*2965: Mg=1018.4: Tg ra 23+-4 #D+(3,1) #E+(2,2) #F+(2,2) #G+(25,8) 
3=94(96, 90) Mp=1018.6( 0.2) Tp- 19 A2= 79/26 P=VMDSKIVQV 
flSPPapt PR=>gi | 913393 | gb 1MC60648 . 1 1 (S75295) nucleoprotein 
interactor 1, NPI~1=SRP1 homolog [human, cervical carcinoma HeLa 
cells, Peptide, 536 aa] [Homo sapiens] PQS=434 (SEQ ID NO: 133) 



G^2976: Mg=10l9.6: Tg=46+-0 #D+(5,2) #£+(6,1) #F+(2,2) #EST+(1,1) 
#FR+(2,1) #G+(4,3) 

15 S*83 (81,90) Mp<=l0l9.6( 0.0) Tp= 40 A2=745/32 P=ALLDKLYAL 
qenpept PE^>gi I 7023341 1 dbj |BAA9192 9.1 1 (AK001830) unnamed protein 
product (Homo sapiens] POS=78 (SEQ ID NO: 134) 

0^=2985: Mg=l02O.5: Tg=45+-0 #D+(5,3) #E+(3,2) #F+(4,3) #FR+U,1) 
20 #G+(7,5) 

$^99(99,99) Mp^lQ20.5( 0.0) Tp= 40 A2*296/27 F=NLASFIEQV 
qenpept PR=>gi | 348907 | gb I AAA35 672 * 1 1 (L15428) 4a-carbinolamine 
W dehydratase [Homo sapiens] POS=19 (SEQ ID NO: 135) 

yQ 

gl 25 G=2998: Mg=1022.4; Tg=44+-0 #G+(5,3) 

fjl S=76(70,90) Mp«1022.4( 0.0) Tp= 43 A2-0.7/12 P-TLWVDPYE 

iff qenpept PR«>gi 1 1703501 1 gb I AAB37580 . 1 1 (U72649) BTG2 [Homo sapiens] 

POS=101 (SEQ ID NO: 136) 

J 30 (3=3002: Mg°1022.5: Tg=45+-1 #S+(2,1) #D+(3,2) #G+(7,4) 

IB S=82(83,8l) Mp=1022.5( 0.0) Tp= 42 A2=>l)c/25 P=KIADFGWSV 

s qenpep j: PR»>gi | 3127068 | gb | AAC77369. 1 | (AF059681) serine/threonine 

Q kinase 13 [Homo sapiens] POS=147 {SEQ ID NO: 137) 

n ] 35 G-3036: Mg=*1025.5: Tg=37+-1 #S #D+(1,1) #F+(4,2) #E5T+(1,1) 

~* #G+{5,3) 

S=90(91,90) Mp=1025.6( 0.1) Tp= 36 A2= 89/28 P-SLLSHVEQL 
qenpept PR=>gi | 53054291 gb!AAD41647 . 1 |AF072933_1 (AF072933) Mad2-like 
N ! protein [Homo sapiens] POS°114 (SEQ ID NO: 138) 



40 



45 



G-3041: Mg°1026.3: Tg=45+-0 #D+(7,3) #FR+(1,1) #G+(4,3) 
S=84{90,72) Mp;=1025.6(-0,7) Tp= 38 A2->lk/30 P=GLADKVYFL 
qenpept PR«>gi 1 1228049 I dbj I BAA11423 . 1 j (D78586) multifunctional 
protein CAD [Homo sapiens] POS=445 (SEQ ID NO: 139) 



6*3061: Mg=1028.5: Tg=35+-1 ftS+{6,2) #D+(3,1) #E+(20,7) #F+(8,5) 
#E$T+(5,2) #FR+(5,2) #G+(11,7) 

3-88(92,81) Mp=1028.5( 0,0) Tp= 32 A2= 88/28 P=GLIEKNIEL 
genpept PR=>gi 1 1632819 I emb I CAA45219 . 1 1 (X63692) DNA 

50 (cytosine-5-)-methyltrans£ erase [Homo sapiens] POS*425 (SEQ ID 
NO: 13) 

G=3073: Mg^l029.5: Tg=51+-0 #D+(1,1) #FR+{2,1) #G+(5,4) #K+(2,1) 
5=81(78,90) Mp=102$.6l 0,1) Tp= 35 A2=>lk/31 P=SLLDIIEKV 
55 qenpept PR=>gi 1 1063586 1 gb IAAB415 64 . 1 1 (L48546) tuberin [Homo 
sapiens] POS-526 (SEQ ID NO: 140) 
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G=3092; Mg<031.4; Tg»61+-1 #S+(8,2j #D+(29,4) #E+(10,6) #F+(3,1) 
#EST+(2,1) «FR+(5 f 2J #H+U,U #K+(1,1) 

S<-84 (82, 90> Mp«1031.6{ 0.2) Tp- 64 A2-86S/30 P=GLYPGLIWL 
■ggPPg pt PR=>gi|2599385|gbIAAB841ll.li (AF027292) interferon 
5 regulatory factor 6 [Homo sapiens] PQS=2l [SEQ ID NO: 14) 

G«3118: Mg*1034.4: Tg=60+-1 #D+(16,5) #E+(45,9) #F+{14,7) 
#EST+(2,1) #FR+(10,3) #G+{11,7) #K+(7,4) 

S=81(85,72) Mp=1034.6( 0.2) Tp*= 66 A2 = 32/21 P-FVfPGELLL 
10 qenpept PR*>gi|12652633|gb|AAH00062.1lAAH00062 (BCD00062) solute 
carrier family 1 {neutral amino acid transporter), member 5 (Homo 
sapiens] POS^89 (SEQ ID NO: 141 J 

G=3127: Mg=1036.3: Tg^36+~0 #F+<2,1) 
15 S='78{77 l 81) Mp=1036.6( 0.3) Tp^ 35 A2=656/30 P=ALNELLQRV 
genpept PR=>gi 1 6682361 ! gb ! AAF23322 , 1 1 AF177198_1 (AF177198 ) talin 
[Homo sapiens] POS=777 (SEQ ID NO:142)ref 

(5=3128; Mg=lG36,3: Tg-36+-l #6+112,7) 
20 5=83(84,81) Mp=1036.5( 0.2) Tp= 29 A2^913/27 P-NLYEGQITV 
Qenpept PR=»>gi 1 1699038 1 gb | AAC50967 . 1 [ (078735] ABC 3 [Homo sapiens] 
POS=555 (SEQ ID NO: 143) 



G«3142: Mg^l037.5: Tg=43+-1 #EST+(2,2) #FR+(5,2) #G+U,1) 
25 #I+(11,5) #J+<5,3) 

5=86(89,79) Mp=*1037,5( 0.0) Tp- 41 A2<=0.1/15 P-FTKDFAPV1 
CP qenpept PROgi i 7022824 | dbj I BAA91736. 1 i (AK001518) unnamed protein 

!n product [Homo sapiens] POS=77 (SEQ ID NO: 1441 



30 G=3l44: Mg»1037,6: Tg°5l+-1 #D+(7,3) #E+(29, 6) #F+(U,7) 

S #EST+(2,1) #FR+{4,2) #G+(12,7) #K+[3,2) #L+(1,1> 

w $=87 (86,90) Mp-1037.7( 0.1) Tp= 53 A2=>lk/31 P=KLLEPVLLL 

* n genpept PR=>gi| 338447 |gbi AAA60583.1 1 (M60854) RPS16 [Homo sapiens] 

P0S=50 (SEQ ID NO:145)ref 



III ^ G=3154: Mg-1038.5; Tg=48+-1 #D+(32,4J #£+(48,9) #F+(7,5) 

CI #EST+(6,2) #FR+<9,3) #G + (24,10) #K+(9,6) 

3=81(82,81) Mp=1038.7( 0.2) Tp- 47 M ; 40B ^0 4n1 ™!"SS 



H o wpept ' PR=>gi|2832596|emb|CAB09792.1| (297056) dJ434Pl,3 (DEAD/H 

40 feS^Iu-Ala-Asp/His) box polypeptide 17 (72kD) ) [Home sapiens) 
POS-146 (SEQ ID NO: 15) ref 

G=3183- Ma^l041.4: Tg=52+-0 *FR+(2,1) #G+(1,1) 

Jvii.m MP=1041.6( 0.2) Tp= 52 A2=>l 5 c/23 P=GLFAPQFW 
45 qenpept PR=>gi| 2062371 1 gblAAB65B50. 11 (U70730) SnoN2 [Homo sapiens] 
POS=274 (SEQ ID NO: 146) 

S=87(90 81) Mp=1042.5( 0.1) Tp- 27 A2=805/27 P=LMVDHVTEV 
50 -npU PB^>gi.9»0612| tf b.M fi 02ll5 1IAT293025.1 ^9302 5 ) steroid 
^Sptor RNA activator isoform 2 [Homo sapiens] POS-163 (SEQ ID 
NO:147) 

G-3201: Mg=1043.5: Tg=58 + -1 #E+(2.2) #F+<6,4) #BST+<1.D frR+12,1) 
55 #G+(7,4) #K+(1,1) 
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S*85{88,8l) Mp=1043.7f 0.2) Tp= 62 A2=408/27 P^FLLPILSQI 
aen£ept PR»>gi 12580552 | gb | AAC51830.il (AF000963) dead box, X isoform 
[Homo sapiens] POS=234 (SEQ ID NO: 148) 

5 G=32l3: Mg=lQ45.5: Tg=58+-0 #E+(1,1) #F+(2,2) #FR+(1,1) 

S«84(90,72) Mp=1044.5(-1.0) Tp= 54 A2«0.3/18 P=*FLIPLNITN 
genpept PR=>gi 1 2224611 1 dbjlBAA20793.il (AB002333) KIAA0335 [Homo 
sapiens] POS=938 (SEQ ID NO: 149) 

10 G-3219: Mg=1046.6; Tg*4(H-l #D+<2,2) #E+(1,1) #F+(2,1> #EST+{4,1) 
#FR+(5,2) #G+(1,1) #K+(3,3) 

5=85(83,90) *4p*1046.7[ 0.1) Tp= 39 A2*243/30 P=NLLPKLHXV 
genpept PR->gi I 4 588524 | gb | AAD2 6136. 1 | AF109196_1 (AF109196) 

intracellular chloride channel p64Hl (Homo sapiens" POS= f 190 (SEQ ID 
15 NO:150) 

G=3227: Mg=1047.6: Tg=44+-0 |D+(1,1) fE+(15,8) #F+(2,2) #EST+<5,2) 
#FR+(7,3) #G+(12,6) 

S=79(82, 72) Mp=>1047.6( 0.0) Tp*= 50 A2=413/31 P=UDRFLATV 
20 genpept PR^>gi | 12653303 |gb (AAH00420. 1 (AAH00420 (SC000420) cyclin I 
(Homo sapiens] POS=72 (SEQ ID NO:151) 

O G=3240: Mg°1049,4: Tg*41+-1 #E+{3,2) #F+(2,2) #EST+(2,2) #FR+(3,3) 

•Si 5=77 (79, 74) Mp=1049.5( 0.1) Tp<= 36 A2=294/29 P=YLDPSVLSGV 

(fl 25 genpept PR=>gi J 505099 | dbj I BAA06683 . 1 1 (D31885) KIAA0069 [Homo 

P| sapiens) POS-84 (SEQ ID NO: 152) 



8f§ 



G~3242r Mg=1049,5: Tg*44+-0 #F+f9,6) 

S=*78 (85,63) Mp=1048.5(-1.0) Tp- 44 A2«378/27 P-LtYPTEITV 
# 30 qgng ept PR=>gi 1 220141 1 cibj IBAA0084 5 . 1 1 (D01038) VLA-3 alpha subunit 

Si [Homo sapiens] POS^798 (SEQ ID NO: 153) 

Q G=*3257; Mg*l051,4: Tg=65+-1 #D+(2,2) #E+{9,4) #F+(9,4) #£ST+<2,1) 

IH #FR+(6,3) 

5' 2 35 S-88(98,9Q) Mp-1051.6( 0.2) Tp= 63 A2^>lk/26 P=NLGDFLIFL 

^ genpept PR<=*>gi 1 1469175 I dbj I BAA09475 . 1 1 (D50916) The KIAA0126 gene is 

jj£ partially related to a yeast gene, [Homo sapiens! POS=638 (SEQ ID 

O NO: 154) 

40 G=3258: Mg-1051.4; Tg-54+-0 #D+(18,4) #E+(10,6) #F+{8,5) #G+U,D 
#K+(3,2) 

S=79(85 f 66) Mp=105l.5( 0.1) Tp= 56 A2=>lk/30 P-GLYEGLTWL 
genpept PR=>gi 1 178989 1 gb |AAA90928 . 1 i (M577C3) ADP-ribosylation 
factor [Homo sapiens] POS=161 (SEQ ID NO: 155) 

45 

G=3270: Mg=1054.3: Tg-51+-0 #D+{5,3) #£>(19,8) #F+<12,7) 

#EST+(2,1) #FR+(5,2) 

3=96(96,99) Mp<L054.5( 0,2) Tp^ 48 A.2=437/19 P=SLFDLNFQA 
qenpept PR~>gi ! 189292 I gb | AAB60701 « 1 1 (M81600) NAD (P) H : quinone 
50 oxireductase [Homo sapiens] POS=227 (SEQ ID NO:156) 

6=3271: Mg=1054.3: Tg=55+-1 #K+(S,2) 

S=80<77,90) Mp=1054.4( 0.1) Tp^= 43 A2=0.0/8 P=MFSLEDSII 

genpept PR=>gi ! 8 09029 1 emb! CAA57993. 1 1 (X82676) tyrosine phosphatase 
55 [Homo sapiens] POS=833 (SEQ ID NO: 157) 



G=3279: Mg-1055.4: Tg=37+-1 ttG+(6,4) 
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[Homo sapiens] p 0 S=6S (SEQ ID NO: 158) l«im«is) MDS018 

5 G=3297: Mg=1057.5: Tg=17t-2 #0+18,4) 

protein L13a [Homo sapiens] POS=137 {SEQ id NO: 159) 

10 g-3309: Mg-1059.5: Tg-34+-0 #F+(3,3) #EST+(3,1) #FR + (2 1) 

^82 (64,79) |^103 9 .6( 0.1) T p- 30 ^82/24 pUlIDHQTYL 
g e "PS Bt PR=>gi f 1477651 1 gb [ AAB05428 > 1 1 (U63610) plectin 
sapiens] POS=4188 (SEQ ID NO: 160) ^ fel °> piectin [Homo 

15 G-3325: Mg-1061.4: Tg°40+-1 #G+(6,3) 

8=85(87, 81) Mp-1061.7( 0.3) T p= 31 A2-523/26 P=AIQDKLFQV 

Sc^^ CBC006123, Similar to 

RIKEN CDNA 0710001P09 gene [Homo sapiens] PO S=96 (S£Q ID NO: 161) 

20 G=3329: Mg=.1062.4: Tg=29+-0 #H+(6,3) 

3-89(89,89) M P *1062.5( 0.1) T p= 27 A2-0.0/9 P=IVKWDRDM 
^ npep PR=> ^ ' 179316 | gb | AAA51811 . 1 1 (M17987) beta-2-microglobulin 
[Homo sapiens] PQS=112 (SEQ ID NO: 162) 



p 25 G=3331: Mg-1062.5: Tg=33+-1 #F + (9,6) #EST+<1,1) #G+[18,10> 

111 S=86(97,63) Mp=1062.6( 0.1) Tp= 30 A2- 6/20 P=RIIDWYNA 

( n aSSEejDt PR=>gi|36150|emb|CAA47670.1| (X67247) ribosomal protein S8 

% [Homo sapiens] POS=77 (SEQ ID NO: 163) 

® G=3342: Mg°1064.4; Tg=20+-4 #E+(l f l) #F+(6,4) #EST+<3,1) #G+(16,6) 

1 S=86{88,82) Mp-1064.6( 0.2) Tp= 19 A2=439/28 P=KIY£GQVEV 

rj flenpept PR=>gi | 550013 | gb I AAA85654.il (014 966) ribosomal protein L5 

|n [Homo sapiens) FOS*117 (5EQ ID NO: 164) 

iii 35 

fcfj G-3364: Wg=1066.3: Tg=50+-0 #E+(3,2) #G+{18,10) #K+(9,5) #L+<1,1) 

^ S=81(85,72) Mp=1066.6( 0.3) Tp- 50 A2=736/26 P-FLPSYIIDV 

J 8 ? genpcpt PR«>gi 1 1045574 I gb | AAC50293 . i| (U37012) cleavage and 

^ polyadenylation specificity factor [Homo sapiens] P0S=185 (SEQ ID 

40 NO: 165) 

0=3384: Mg=1068.4: Tg=29+-2 #S+(8,2) #D+(3,I) #£+(17,4) #F+(8,5) 
#EST+(5,2) #FR+{6,3) #G+(18,7) #K+(4,2) 

S«87(90,81) Mp=1068.6[ 0.2) Tp° 29 A2=482/23 P=ALSDHHIYL 
45 genpept PR=>gi I 28597 | emb | CAA28861 . 1 1 (X05236) aldolase A <AA 1-364) 
[Homo sapiens] POS-216 (SEQ ID NO:16) ref 

G-3385: Mg^lQ69.3: Tg=37+-1 #F+(2,2) #G+(7,5) 

S=84 (92, 66) Mp=1069.3( 0.0) Tp= 25 A2=855/25 P=YMMPVNSEV 
50 genpept PR°>gi 1 12667401 1 gb IAAK01426 . 1 j AF326731 1 (AF326731) NUF2R 
[Homo sapiens] POS=65 (SEQ ID NO: 166) 

G=3406: Mg=1071.5: Tg^20+-3 #D+(1,1) #FR+[2,1) #G+(20,7) 
S=94(97,90) Mp=1071.6{ 0.1) Tp=> 24 A2=109/30 P-ILDQKINEV 
55 Qenpept PR=>gi | 338278 | gb | AAA60563 , 1 1 (M31061) ornithine 

decarboxylase [Homo sapiens] POS=23 (SEQ ID NO: 17) ref 
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G=3410: Mg=1071.6: Tg=8+-8 #D+(7,1) #F+(6,1) #FR+(6,1) #G+(25,5) 
#K+(4,3) 

S=94(96,90) Mp=1071.7( 0.1) Tp» 12 A2= 53/29 P=ILDKKVEKV 
genpept PR=>gi 1 3867861 gbl AAA36Q2 6, 1 1 (J04986) 90 JcD heat shock 
5 protein [Homo sapiens] POSo570 (SEQ ID NO: 18) ref 

G^3418: Mg=1073.6: Tg=5+-7 #F+{1,1) #G+(3,3) 

3=79(79,81) Mp=1072.5(-l.l) Tp° 5 A2^Q.1/14 P=NKDLKMPKV 
genpept PR«>gi 1 1808578 | dbj I BAA07918 . 1 1 (D44466) proteasome subunit 
10 pll2 [Homo sapiens] POS°792 {SEQ ID NO: 167) 

G-3424: Mg=l074.6: Tg=46+-0 #K+(6,4} 

$=90(90,90) Mp^l074~,4 (-0.2) Tp= 31 A2=201/28 P^NLAEDIMRL 
genpept PROgi | 37852 1 emb ICAA79613 . 1 1 (Z19554) vimentin [Homo 
15 sapiens] POS=177 (SEQ ID NO: 168) 

GK34 27 ; Mg=1075.4: Tg=46+-0 #D+{1<1) #F+(8,6) 

S-72{73,72) Mp=lQ75.6( 0.2) Tp= 44 A2=>lk/31 P=YI J P£LLQTV 
genpept PR->gi 1 12653299 ( gb f AAH00418 . 1 I AAH00418 (BC000418) 

20 ectodermal-neural cortex (with STB-liJce domain) [Homo sapiens] 
POS*=228 (SEQ ID WO: 169) 

O G-3470: Mg=1080.4: Tg-62+-l #D+(13 # 4] #E+(17 r 8) #F+(U,6) 

€| #EST+{2,1> #FR+(6,3) #G+(19,9) #K+(10,5) #L+(1,1) 

63 2 5 S=82{82,82) Mp^l080.6( 0.2) Tp= 69 A2=>lk/27 P-FLYPFPLAL 

fp genpept PR=*>gi 1 436224 1 dbj I BAAC5062 . 1 1 (D26067) KIAA0033 [Homo 

j|| sapiens] POS-185 (SEQ ID NO: 170) 

m 

^ G-3472: Mg-1080.4: Tg=50+-0 #F+[13,7) #G+(25,11) #K+(10,5) 

+; 30 3=76(87,52) Mp«1080.5( 0.1) Tp- 53 A2=182/33 P= S LL P P T AL VGL 
genpept PR=>gi 1 1296664 | emb I CAA65774 . 1 1 (X97064) Sec23 protein [Homo 

s sapiens] POS^156 (SEQ ID NO: 19) 

jfl G=3476: Mg=1080.7: Tg<=41t-1 #FR+(1,1) #G+(5,4| 

s»j 35 3=75(77,72) Mp=1080 . 6 (-0 - 1 ) Tp= 38 A2=>lk/29 P=NLYPFVKTV 

Ji? genpept PR=>gi| 1263196 lgb!AAA974Q5. 1 j (U37436) A I CAR 

!if fonnyltransferase/IMP cyclohydrolase bifunctional enzyme [Homo 

O sapiens] POS=101 (SEQ ID NO; 171) 

40 G=3477: Mg=1081>4: Tg=56+-0 #F+(6,3) 

$=90(87,99) Mp«l081.7( 0.3) Tp- 57 A2~>lk/24 P=SVIEQLFFV 

genpept PR->gi | 3014 0 ! erab | CAA34277 . 1 1 (X16155) COUP-TF [Homo sapiens] 
POS=378 (SEQ ID NO: 172) 

45 G=3478: Mg-1081,4: Tg=56+-0 #G+(4,3) 

S=84 (86,81) Mp=1080.6(-O.8) Tp=> 57 A2=>lJc/29 P-SLLEPFVYL 
ggnpept PR=>gi i 7008404 | gb I AAF34999 . 1 1 (AF229840) kappa B-ras 2 [Homo 
sapiens] FOS°156 (SEQ ID NO: 173) 

50 G=3497: Mg=1084.7: Tg=24+-3 #F+(1,1) #EST+(3,1) #FR+{2,1) 

$=79(82,72) Mp-=1084.6(-0. 1) Tp= 37 A2°437/26 P=ILFGHENRV 
genpept PR=>gi | 5911941 1 emb | CABS 594 6 . 1 1 (AL117471) hypothetical 
protein [Homo sapiens) POS°250 (SEQ ID NO:174)re£ 



55 



G=3505; Mg=1086,5: Tg=19+-3 #G+(1S,6) 
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S=88 (91, 83) Mp=>1086.7( 0.2) Tp= 20 A2<*998/29 P-KLQEVGQVSV 
genpept PR->gi | 34 0307 | gb 1 AAA36808 . 1 1 (M14648) vitronectin alpha 
subunit precursor [Homo sapiens] POS=338 (SEQ ID NO: 175 ) 

5 G=3520: Mg=1088.6: Tg=37+-1 #F+(3,2) #EST+(2,1) #G+(8,5) #K+(5,4) 
S=72(82,49) Mp-1088.5(-0.1) Tp= 37 A2 = 75/16 P°RLFDEPQLA 
genpept PR=>gi I 3334982 | gb | AAC26984 . 1 | AAC26984 (AC005306) R27216J. 
[Homo sapiens] POS=2 (SEQ ID NO: 176) 

10 G=352l; Mg=l088.6: Tg=44+-1 #G+(5,4) 

S=70(71,68) Mp=1088.6( 0.0) Tp= 43 A2~119/30 P°$LFPGKLEW 
genpept PR=>gi I 440177 | gb I AAC03568 , 1 1 (U01184) flightless-I homolog 
[Homo sapiens] POS=1009 (SEQ ID NO:177) 

15 G=3526: Mg=1089.6: Tg=50+-1 #S+(10,1) 

S=84 (91,68) Wp=1089.6( 0.0) Tp~ 57 A2« 37/23 P^VMLGTPFLVI 
genpept PR=>gi I 458 9536 1 dbj I BAA76790 . 1 1 (AB023163) KIAA0946 protein 
[Homo sapiens] POS=340 (SEQ ID NO: 178 ) 

20 G=3533: Mg°1091.4: Tg°15+-2 #G+(12,4) 

S=86(95,68) Mp=1091.4( 0.0) Tp« 13 A2= 80/20 P=GVYDGEEHSV 
genpept PR=>gi I 4102749 |gb| AAD01565. 1 1 (AF015766) MAGE XP-2 protein 

O [Homo sapiens] POS-231 (SEQ ID NO: 20) 

JJjj 25 G=3545: Mg=lQ94,4; Tg^50+-1 ttE+(22,9) #F+<16,7) #EST+(4,2) 

ff* #FR+(4,2) #G+(50,11) #K+(11,6) #L+(2,2) 

£ 3=80(89,59) Mp=1094.5( 0.1) Tp= 49 A2=l82/33 P«SLLPPDALVGt 

Hi genpept PR=>gi 1 13529299 1 gb I AAH05404 . 1 |AAH05404 (BC005404) Unknown 

111 (protein for MGC:5020) [Homo sapiens] PQS=156 (SEQ ID NO: 21) 

4=? 3Q 

03 G°3563: Mg-1098.3: Tg=38+-1 #D+P,3) #E+(10,6) #F+(5,4) #EST+(3,2) 

s #FR+(4,2) #G+(12,8) 

H S=88(88,90) Mp°1098.4( 0.1) Tp= 30 A2=280/26 P=SLYDYNPNL 

Jp genpept PR=>gi 1 3337383 1 gb | AAC27 4 26 . 1 1 (AC002544) Translation 

35 initiation factor elF-pllO [Homo sapiens] POS°381 (SEQ ID NO: 179) 

ru 

ill G=3566: Mg=109B.6: Tg-50+-0 #E+(6,3) ttEST+(3,l) #FR+(2,1) 

p $=92(89,68) M P =1098.7< 0.1) Tp- 62 A2-194/25 P-FLLQPRLVLA 

H> genpept PR=>gi 1 887368 1 gb| AAC4 2003 . 1 1 (L40397) ORF; putative [Homo 

40 sapiens] POS=31 (SEQ ID NO: 180) 

G=3579: Mg=1101.4: Tg=34+-1 #F+(5,4) #G+(2,2) 

5=88(91,83) Mp=ll01.4( 0.0) Tp= 35 A2=502/24 P=FLYTGEGDTV 
qenpept PR=>gi 1 1184320 I gb I AAC50373 . 1 1 (045880) X-linlced inhibitor of 
45 apotosis protein [Homo sapiens] POS=52 (SEQ ID NO: 181) 

G-3588: Mg~1102.6: Tg=34+-1 #£+(1,1) #EST+(4,2) #FR+(2,2) 
5=82(83,81) Mp=U02.6< 0.0) Tp= 27 A2«>lk/26 P=KLNPQQFEV 
qenpept PR=>gi | 624704 1 gb I AAB05994 . 1 | (L38 961) putative transmembrane 
50 protein precursor [Homo sapiens] POS-289 (SEQ ID NO: 182) 

G=3596: Mg=ll03.4: Tg=28+-2 #F+(3,2J #G+(1,1) 

S~88<94,74) MD-1103.4< 0.0) Tp= 23 A2=l40/27 P=SLADLQNDEV 
genpept PR->gi I 854179 | emb I CAA60827 . 1 1 (X87373) ribosomal protein S3a 
55 [Homo sapiens] POS=70 (SEQ ID NO: 183) 

G=3603: Mg=1104.7: Tg=4$+-0 #D+<1,1) #F+{4,3) #EST+(3,1) #FR+(5,2) 
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S°85(99,54) Mp°1104.7( 0,0) Tp° 40 A2°364/28 P^RLLDYWNI 
genpept PR=>gi I 7023768 I dbj I BAA92081 . 1 1 (AK002094) unnamed protein 
product [Homo sapiens] POS=172 (SEQ ID NO: 184) 

G*3629: Mg=>1113-5: Tg°35+-1 #G+{10,6) #K+(10,5) 

S=77(76,81) Mp=1113.6{ 0.1) Tp= 31 A2- 46/21 P=FVDDYTVRV 
genpept FR=>gi 1 1923256 | gb | AAC518 66 . 1 1 (U86782) 26S 

proteasome-associated padl homolog [Homo sapiens] POS=61 {SEQ ID 
NO:185) 



10 



m 



G=3637: Mg=1115.4: Tg=55+-1 #£+(29,8) #F+{14,7) #EST+<2,1) 
#FR+(4,2) #G+(9,5) #L+(1,1) 

S=82 (90, 66) Mp=lll5.5( 0.1) Tp° 61 A2=>lk/29 P=SLFEGTWYL 
genpept PR=>gi j 126530 65 |gb I AAH00297. 1 IAAH0Q2 97 (BC000297) 

15 3-hydroxy-3-methylglutaryl-Cosnzyme A synthase 1 (soluble) [Homo 
sapiens] POS=447 (SEQ ID NO: 186) 

G=3652: Mg°1119.5: Tg=56+-0 #D+(4,3) 

S=80(84,72) Mp-lU9.7( 0.2) Tp= 57 A2=5l2/27 P-ALYNWLIQV 
20 genpept PR°>gi I 3288447 | emb I CAA07553 . 1 1 (AJ007558) nucleoporin 155 
[Homo sapiens] POS=1038 (SEQ 10 NO: 187) 

O G=3653: Mg=1119.6: Tg=30+-1 #D+U,1) #F+(3,2) #EST+(1,1) #FR+(2,1) 

#G+(10,5) 

C(J 25 S=8Q(84,72) Mp=1119.7( 0.1) Tp= 30 A2= 97/25 P=VLIDYQRNV 

genpept PR=>gi I 2626840 | dbj | BAA234 15 . 1 1 (D89729) CRMl protein [Homo 
sapiens] POS«=784 (SEQ ID NO: 188) 

li] G=3658: Mg=1121.3: Tg=49+-0 #S+(9,2) #D+(7,3) #E+(8,5) #F+(14 # 7) 

30 #G+(26,11) #K+(2,2) #L+(l f l) 

OS S-84(S1 # 91J tfp=1121.5< 0.2) Tp= 47 A2=577/24 P=TLWVDPYEV 

B genpept PR=>gi 1 1703501 1 gb I AAB37560 . 1 1 (U72649) BTG2 [Homo sapiens] 

Q POS°101 (SEQ ID NO:22) ref 

«! 35 G=3683: Mg=1128.3: ?g=51+-0 #S #G+<17,9) #K+(2,1) 

W S=86(88,82) Mp=1128.5( 0.2) Tp= 55 A2-348/25 P=FTW5GIpYNV 

*9 genpept PR=>gi 1 1276912 1 gb | AAC50450 . 1 1 (044839) UHX1 protein [Homo 

© sapiens] POS=3S3 (SEQ ID NO: 18 9) 

40 G-3694: Mg=1133.6: Tg=>25+-3 #D+(2,1) #F+(1,D ttG+(7,3) 

S=85<87,81) Mp°1133.7( 0.1) Tp= 30 A2=>l)c/32 P=ILMEHIHKL 
genpept PR=>gi I 298486 I gb | AAB25672 . 1 [ (S56985) ribosomal protein L19 
[human, breast cancer cell line, MCF-7, Peptide, 196 aa] [Homo 
sapiens] P0S=137 (SEQ ID NO:190) ref 

45 

G=3697: Mg~1134.6: Tg-42+-l #E+{12,6) #F+(1,1) #EST+(3,1) 

#FR+(3,1) #G+(9,5) #K+{13,6) #L+(1,1) 

S=81(93,53) Mp=1134.6{ 0.0) Tp= 37 A2-193/26 P-RLDELGGVYL 
genpept PR->gi 1 13374901 1 emb | CAC34517 . 1 1 (AL031659) dJ343K2 .2.3 
50 (ribophorin II (isoform 3)) [Homo sapiens] POS^185 (SEQ ID NO:191) 

G=3711: Mg-1140.6: Tg=40+-1 #EST+{1,1) #FR+(1,1) #G+(1,1) 
S~89(93,82) Mp=1140.7< 0.1) Tpo 40 A2=526/27 P=KI*LSKFY£L 
genpept PR=>gi 1 10439903 | dbj I BAB15591 . 1 1 (AK026930) unnamed protein 
55 product [Homo sapiens] POS^231 (SEQ ID NO: 192) 

G=3721: Mg=1145.4: Tg=49+-1 #3+17,1) #F+(2,2) #G+(14,l0) 
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S=79{83,70) Mp°1145.5( 0.1) Tp= 50 A2=>lk/23 P=FLf DGSPTYV 
genpept PR^>gi 1 1049053 [ gb IAAC50259. 1 1 (U26644) encodes region of 
fatty acid synthase activity; FAS; multifunctional protein [Homo 
sapiens) POS=2329 (SEQ ID N0:23) 

5 

G-3728: Mg°1147.5: TgM8+-l #E+(3,2) #EST+(2,1) #FR+(5,2) #G+(4,3) 
#K+(13,6) 

S°91(92,90) Mp=1147.7( 0.2) Tp= 45 A2->lk/20 P^KVLDFEHFL 
genpept FR=>gi 1 189022 I gb | AAA36348. 1 1 (M22920) smooth muscle mysoin 
10 light chain [Homo sapiens] POS=28 (SEQ ID WO: 193) 

G-3743: Mg=U52.6: Tg=47+-0 #D+(5,3) #f+(l,l) 

S=79(82,72) Mp=1151.6(-1.0) Tp- 43 A2=>lk/24 P=YLPEDFIRV 
genpept PR=>gi j 2653877 | go | AAB87669 . 1 | (AF026273) interleukin-1 
15 receptor-associated kinase-2; IRAK-2 [Homo sapiens] POS^381 (SEQ ID 
«0:194) 

G-3754: Mg=1156.5: Tg=35+-1 #2+13,2) ttK+(8,5) 

$=91(95,83) Mp=llS6*5( 0.0) Tp= 43 A2=403/28 P-FLSEHPNVTL 
20 genpept PR°>gi | 5102831 1 emb I CAB45270 . 1 j (AL022318) bK150C2.2 
(Phorbolin 3) [Homo sapiens] POS=107 (SEQ ID NO: 195) 

t' G=38Q6: Mg=1210.4: Tg=*42+-1 #E+(7,4) #EST+(1,1) #FR+(5,3J 

^ #G+(20,11) #K+(10,6) #L+(4,2) 

CO 25 3=76(80,68) Mp=l210.6( 0.2) Tp= 44 A2=128/21 P - L L L DVPT AAVQ A 

p qrenpept PR^>gi 1 6165618 igb (AAF04 618 . 1 |AF09?362_1 (AF097362) 

jj| gamma-interferon inducible lysosomal thiol reductase [Homo sapiens] 

m POS=26 (SEQ ID NO: 3) ref 

"(5? 8 

5 30 G=3831: Mg-1258.5: Tg^54+-1 #S+{12,2) #E+(l2,6) #F+(12,6) 
® #£ST+(1,1> #FR+(7,3) #G+(2Q,10) #H+(1,1) #K+(10,5) 

S=87(96,68) Mp=l258,6( 0.1) Tp= 58 A2=611/27 P=SXFDGSPTYV1 
Q genpept PR=>gi 1 104 9053 1 gb| AAC50259. 1 1 (U26644) encodes region of 

|j] fatty acid synthase activity; FAS; multifunctional protein [Homo 

fjj 35 sapiens] POS°2329 (SEQ ID NO: 24) 

^ G=3859: Mg°1360.4; Tg-44+-l #£+(3,2) #G+(19,10) 

H S=91(99,7S) Mp=1360.6( 0.2) Tp= 42 A2=>lk/28 P=ALWDI ETGQQTV 

^ genpept PR=>gi [ 306785 I gb| AAA35922 . 1 1 (M1653B) G protein beta subunit 

40 [Homo sapiens] P0S=167 (SEQ ID NO:25) 



DISCUSSION 

Among the thousands of different peptides presented within the context 
of the MHC class-I on cancer cells, only a few may eventually become 
45 candidates for the development of anti-cancer vaccines. The identification of 
such cancer specific peptides depends on sequencing a relatively large number 
of peptides. 



9J 

While reducing the present invention to practice, a novel method was 
developed to identify candidate peptides for the development of anti-cancer 
vaccines. The novel method involves expressing the soluble extra-cellular 
domain of the MHC molecules that are simple to purify and the recovery, from 
them, large amounts of MHC bound peptides ready for identification by 
ESI-MS/MS. 

Purification of the extra-cellular domain of MHC was previously 
achieved by truncating its entire transmembrane and cytoplasmic domains 
[30], by using a non-fiincttonal transmembrane domain such as Q10*> [24] or 
fusing the extra-cellular domains to soluble secreted proteins such as 
antibodies Fc domains [31, 32]. Such sMHC molecules were produced in 
cultured cells of murine [33], human [30, 34] or insect [35] and in bacteria 
[36]. The soluble MHC molecules expressed by the murine or the human cells 
were capable of binding to their cognate TCRs, indicating the presence of 
bound authentic peptides that mediate this interaction [33, 37]. Bound 
peptides recovered from the secreted murine MHC H-2Ld were analyzed by 
Edman sequencing [38], More recently, peptides recovered from the murine 
Q2/Q10b, which is a natural mutation resulting in the formation of soluble and 
secreted MHC molecules, were analyzed by ESI-MS/MS [39]. The results, 
however, were very disappointing as only six peptides were recovered from 50 
liters of culture medium [39], 
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While culture cancer cell lines are invaluable model for cancer research, 
only a limited number of good model lines are available for the study of tumor 
immunology since some of the better model cell lines have rare MHC 
haplotypes or down regulated MHC expression altogether. The introduction of 
5 foreign MHC into such cells in accordance with the teachings of the present 
invention facilitates the use of the desired model cell lines for the search for 
cancer specific MHC bound peptides. The recovery of secreted MHC from the 
growth medium helps to sidestep possible interference by the cell's 
background MHC haplotypes. 

0 

10 The number of peptides identifiable during each ESI-MS/MS run 



m 



U3 



performed in accordance with the present invention was limited by the rate the 

mass spectrometers can switch between measuring the full spectrum to 

ffl 

T performing CID 3 which was about four seconds. Therefore, during a 

IH chromatography of ninety minutes, around a thousand different peptides could 

ru 

iQ 15 be mass measured and fragmented. The elution order of most of the peptides 

0 

^ recovered for MHC of a particular type and resolved in different 

chromatography runs was similar. Therefore, their masses and CID data were 
combined in order to improve their signal-to-noise ratio. 

About one thousand different molecules that are certainly peptides have 
20 been fragmented at least twice in all the different chxomatographs and out of 
these about two hundreds different peptides have been identified at high 
certainty. Most of these peptides were derived from housekeeping proteins 



29-MAY-2001 El: 22 FROM: 



TO: 972 9 7706333 



P. 094 



i 
m 



10 



93 

and only a few were derived from proteins related to cancer. To increase the 
likelihood of identifying more new cancer specific peptides, the total number 
of identified peptides should be farther enlarged. Identification of large 
number of peptides is currently limited by both the availability of sufficient 
5 amounts of peptides, by the capabilities of the mass spectrometers and by the 
non-^completeness of the databanks. With the expected near availability of the 
entire human genome sequence, it is expected that more of the peptides will be 
identifiable, excluding mutant peptides that will still need to be sequenced de 
novo. 

The soluble and secreted MHC molecules described here present 
similar patterns of peptides as do the original cell surface MHC, This 
conclusion emanates from the observation that most of these peptides, posses 
an amino acid sequence that fit the known sequence consensus of HLA-AZ1 
and of B7 (see score columns in Table 8 above). Some of the peptides have 
15 been identified previously as MHC bound peptides and thus indicate the 
validity of the methodology of the present invention. The most significant 
advantage of the use of secreted MHC as a source for peptides for analysis has 
to do with the order of magnitude larger recovery of sMHC molecules and 
therefore peptides per cell over the alternative purification from detergent 
20 solubilized cells and the purified sMHC molecules were free of interfering 
cellular debris and detergents. 
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Direct biochemical analysis of peptides eluted from MHC molecules 
that are recovered from cancer cells, allows unbiased identification of those 
peptides that are actually presented by the MHC. Even though, identifying 
putative MHC bound peptides using computer programs based on the 
5 consensus motifs followed by synthesizing them and testing their 
immunogenicity, bypasses the reliance on expensive and technically 
demanding mass-spectrometry needed for biochemical analysis of MHC bound 
peptides. However, the motif prediction approach is dependent on the 
n availability of well-established consensus for the MHC allele of interest and is 

03 10 hampered by the difficulty of taking into account the processing machinery 

IP 

U1 involved in generating the peptides and transporting them to the MHC [13]. 

ill 

45 Moreover, it was suggested that contaminating protecting groups inadvertently 

IS 

left on the synthetic peptides are very immunogenic and may become the target 

Lj 

jjjj for the activity of the CTLs. The CTLs generated in vitro are often low 

i If 

15 affinity binders and incapable of recognizing the rare peptides actually 
presented by the cancer cells in vivo [10], 

The examples of identified peptides listed in Table 8 above include 
peptides that do not fit the accepted consensus of MHC bound peptides 
presented by the studied MHC haplotypes. Peptides longer than ten amino 

20 acids are not expected to be common among MHC class-I peptides [40, 41). 
However, in this study, few peptides of 1 1 amino acids (pl210 s SEQ ID NO:3 
and p!258, SEQ ID NO:24) and 12 amino acids (p!360, SEQ ID NO;25) long 
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were observed among the identified peptides. The computer programs for 
motif predictions of class-I peptides are not able to predict such peptides as 
their length is outside the consensus [42, 43]. The detection of longer peptides 
among the peptides in the natural mixture suggests that the consensus motif 
5 should possibly be extended to include such outliers. Another interesting 
observation is the relatively abundance of peptides that originated from 
overlapping parts of the proteins with one or two amino acids difference in 
length such as p800 (SEQ ID NO;4) and p913 (SEQ ID NO:5) from p-catenin, 
p pi 145 (SEQ ED NO:23) and P 1258 (SEQ ID NO;24) from fatty acid synthase, 

BO. 

5 10 p898 (SEQ ID NO:l), plOl 1 (SEQ ID NO:2) and pl210 (SEQ ID NO:3) from 

5 

III IP30 (Table 8). Moreover, peptides pl080 (SEQ ID NO:19) and pl094 (SEQ 

in 

*P ID NO;21) are derived from homologous site in two alleles of the same 

L, protein. This observation points to the existence of structural hotspots for 

p j generation of peptides, possibly as a result of heath-shock proteins binding and 

q is protection from complete proteolysis of these regions. Differences in length 
could also result from incomplete trimming of the peptides in the endoplasmic 

reticulum [44, 45], 

Interesting observations are the large similarities between the patterns 
of peptides produced by cell lines of different tissue origin and on the other 
20 hand, the presence of a few peptides that are unique to one type of cancer cells. 
The ability to characterize the similarities and differences between peptide 
patterns of different cell lines and growth conditions and between different 
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HLA haplotypes are among the most important possible uses of the novel 
methodology presented herein. 

The most effective mean to ascertain the identity of the amino acid 
sequences of peptides that were identified by this method is to compare their 
5 retention times, their exact masses and their CID data to those of the 
corresponding synthetic peptides [16, 39, 46, 47]. The sequences of all the 
peptides that were identified at high confidence by searching the databank with 
their mass spectrometry data were shown to be correct when these parameters 
were compared with the corresponding synthetic peptides. 
10 A number of peptides identified here were derived from known tumor 

antigens. Those peptides that attracted the attention as possibly cancer specific 
were chemically synthesized and tested again. The fact that a few of them 
elicited a CTL response in mice may point to their possible immunogenicity in 
human, 

15 Tumor proteins from which identified peptides were derived included 

mucin (MUCl), a well-studied rumor-associated antigen that is up regulated in 
breast and ovarian carcinomas [48]. A number of HLA-A2.1 restricted 
MUCl -derived CTL epitopes were identified by the motif prediction approach 
[26, 49-52]. Peptide p947 (NLTISDVSV, SEQ ID NO:8) identified here from 

20 breast carcinoma cells (MCF-7) is the same peptide that was predicted and 
confirmed to be a HLA-A2 antigen originating from MUCl by Cannon et al. 
[26]. 
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Another peptide derived from a known tumor antigen, was pi 091 (SEQ 
ID NO:20) from the testis-cancer antigen MAGE-B2. It belongs to a group of 
21 known genes that are essentially silent in most normal cells except for testis 
and placental trophobalsts and since different member of the MAGE proteins 

5 are expressed in a variety of tumors* they attracted significant attention as 
cancer vaccine candidates [53-57]. A few peptides were identified so far from 
the MAGE proteins by genetic approach and by predicting their sequence 
based on the known motifs rather than by the biochemical approach [27 s 28, 
58-61] (reviewed in [10]). The identification of the novel MAGE-B2 derived 

io peptide p!091 (GVYDGEEHSV) (SEQ ID NO:20) by the direct biochemical 



yl approach is a very encouraging observation that confirmed the validity of this 

II! 

method for identification of novel tumor specific antigens* Homologous 



peptides from MAGE-A4 and MAGE-A10 proteins were previously identified 
as MHC bound peptides and tested for their immunogenicity (see Figure 4D). 

15 This suggests the existence of a possible hot spot within the MAGE protein for 
processing as MHC bound peptides [27, 28], 

Peptides derived from other proteins that are involved with cancer 
progression and may also serve as candidates for anti-cancer vaccines of 
diagnosis include p913 (SEQ ID NO:5) from p-catenin, which is normally 

20 involved in cellular adhesion, signal transduction and as a transcription 
enhancer with a possible oncogenic role in colorectal cancer. Abnormal high 
amounts of the protein were found in the cytoplasm in cancer cells instead of 
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the intracellular boundary in normal cells and this abnormal behavior was 
correlated with metastasis [62-64). Peptide pi 145 (SEQ ID NO:23) and P 1258 
(SEQ ID NO;24) is derived form fatty acid synthase (FAS), a biosynthetic 
enzyme expressed in liver and lactating breasts and is a marker of poor 
5 prognosis when expressed in colon, prostate, ovarian, breast and endometrial 
cancers. Its significance for cancer is was established by inhibiting it activity, 
which leads to apoptosis in cancer cells [65-69]. The enzyme DNA methyl 
transferase (MTDM) is the source protein for pl028 (SEQ ID NO:13) an 
q etrayme that is highly expressed in different cancer cell types, including 

Hi io prostate and breast [70-72], Increased MTDM activity is usually associated 

with tumor progression and is considered to be an important event in cell 
transformation [71 s 73]. 

Once tumor specific MHC bound peptides are identified and their 
ability to stimulate an immune response is demonstrated, such peptides 
15 become candidates for adoptive immunotherapy. Identification of peptides 
originating from normal proteins that are uniquely expressed in non-vital 
organs, such as breast, prostate and ovaries can become very useful for 
immunotherapy of these cancers. The potential usefulness of identified 
immunogenic peptides should be evaluated by the presence of specific T cells 
20 directed against them in patients inflicted with the particular cancer using 
standard assays such as ELISPOT and CTL. The assay of immunizing mice 
with the peptides described herein was meant to serve first as validation that 
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these peptides are indeed MHC bound peptides with affinity for the HLA-A2.1 
and as the preliminary indication of their immunogenic potential. 

Secreted soluble MHC such as described herein can also be used for 
analysis of peptides presented by cells involved with pathologies other than 
5 cancer, such as autoimmune diseases and viral infections with the aims of 
identifying peptides of significance for treating these diseases. The method 
can also be used for identifications of MHC bound peptides presented on 
normal cells of specific tissues, peptides presented by particular MHC alleles 
p and peptides originating from expression of particular proteins of interest. 

io Moreover, the approach can be used for analysis of MHC bound peptides 
derived from over-expression of specific proteins, from induced mutations, as 
a result of metastasis progression and as a way for searching for peptides 

derived from signal peptides of cell surface proteins. The approach described 

Lnj 

pi in this study is also useful for comparisons between patterns of MHC bound 

• #i 

q 15 peptides induced by minor changes in the cells growth conditions such as the 

addition of hormones, the expression of a foreign protein or under stress 
conditions. 

Therefore, an appealing outcome of the methodology described herein 
is that the simple expression of different recombinant MHC molecules in 
20 different cell lines in a soluble, secreted form and their easy recovery from the 
growth medium with their peptides still attached, followed by comprehensive 
analysis of the peptides may become a good staging point for above listed 
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ambitious research projects. Such 'human MHC-peptide projects' are worthy 
goals to follow the human genome and proteome projects. 

It is appreciated that certain features of the invention, which are, for 
5 clarity, described in the context of separate embodiments, may also be 
provided in combination in a single embodiment. Conversely, various features 
of the invention, which are, for brevity, described in the context of a single 
embodiment, may also be provided separately or in any suitable 



P subcombination. 



10 



Although the invention has been described in conjunction with specific 



|| embodiments thereof, it is evident that many alternatives, modifications and 

p variations will be apparent to those skilled in the art. Accordingly, it is 

til 

oj intended to embrace all such alternatives, modifications and variations that fall 

P is within the spirit and broad scope of the appended claims. All publications, 

patents and patent applications mentioned in this specification are herein 
incorporated in their entirety by reference into the specification, to the same 
extent as if each individual publication, patent or patent application was 
specifically and individually indicated to be incorporated herein by reference. 
20 In addition, citation or identification of any reference in this application shall 
not be construed as an admission that such reference is available as prior art to 
the present invention. 
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